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Introduction 


This unit is the first of three that together introduce the fundamentally 
important topic of calculus. Calculus provides a way of solving many 
mathematical problems that can’t be solved using algebra alone. It’s the 
basis of essential mathematical models in areas such as science, 
engineering, economics and medicine, and is a fascinating topic in its own 
right. 


In essence, calculus allows you to work with situations where a quantity is 
continuously changing and its rate of change isn’t necessarily constant. As 
a simple example, imagine a man walking along a straight path, as shown 
in Figure 1. His displacement from his starting point changes continuously 
as he walks. As you’ve seen, the rate of change of displacement is called 
velocity. 
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Figure 1 A man’s displacement along a path 


Since the man’s displacement and velocity are along a straight line, they 
are one-dimensional vectors, and hence can be represented by scalars, with 
direction along the line indicated by the signs (plus or minus) of the 
scalars, as in Subsection 2.4 of Unit 2, and as mentioned near the start of 
Subsection 5.1 of Unit 5. In the calculus units in this module, we’ll work 
with displacement and velocity only along straight lines, and hence we'll 
always represent these quantities by scalars. 


If the man’s velocity is constant, then it’s straightforward to work with the 
relationship between the time that he’s been walking and his displacement 
from his starting point. As you saw in Unit 2, you just use the equation 


displacement = constant velocity x time. 


For example, if the man walks at a constant velocity of 6 kilometres per 
hour, then after two hours his displacement is 


(6 x 2)km = 12km. 


However, suppose that the man doesn’t walk at a constant velocity. For 
example, he might become more tired as he walks, and hence gradually 
slow down. Then the relationship between the time that he’s been walking 
and his displacement is more complicated. Calculus allows you to deal 
with relationships like this. 
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Calculus applies to all situations where one quantity changes smoothly 
with respect to another quantity. In the example above, the displacement 
of the walking man changes with respect to the time that he’s been 
walking. Similarly, the temperature in a metal rod with one end near a 
heat source changes with respect to the distance along the rod. Similarly 
again, the air pressure at a point in the Earth’s atmosphere changes with 
respect to the height of the point above the Earth’s surface, the 
concentration of a prescription drug in a patient’s bloodstream changes 
with respect to the time since the drug was administered, and the total 
cost of manufacturing many copies of a product changes with respect to 
the quantity of the product manufactured. You can see from the diversity 
of these examples just how widely applicable calculus is. 


Basic calculus splits into two halves, known as differential calculus and 
integral calculus. (In this context ‘integral’ is pronounced with the 
emphasis on the ‘int’ rather than on the ‘eg’.) Roughly speaking, in 
differential calculus, you start off knowing the values taken by a changing 
quantity throughout a period of change, and you use this information to 
find the values taken by the rate of change of the quantity throughout the 
same period. For example, suppose that you’re interested in modelling the 
man’s walk as he gradually slows down. You might have worked out a 
formula that expresses his displacement at any moment during his walk in 
terms of the time that he’s been walking. Differential calculus allows you 
to use this information to deduce his velocity (his rate of change of 
displacement) at any moment during his walk. 


In integral calculus, you carry out the opposite process to differential 
calculus. For example, suppose that you’ve modelled the man’s walk by 
finding a formula that expresses his velocity at any moment during his 
walk in terms of the time that he’s been walking. Integral calculus allows 
you to use this information to deduce his displacement at any moment 
during his walk. 


Surprisingly, you can also use integral calculus to solve some types of 

problem that at first sight seem to have little connection with rates of 
change. For example, you can use it to find the exact area of a shape 

whose boundary is a curve or is made up of several curves. 


This unit, together with the first half of Unit 7, introduces differential 
calculus, while the second half of Unit 7, together with Unit 8, introduces 
integral calculus. You’ll also use calculus in Unit 11, Taylor polynomials. 


The name ‘calculus’ is actually a shortened version of the historical 
name given to the subject, which is ‘the calculus of infinitesimals’. 
The word ‘calculus’ just means a system of calculation. The word 
comes from Latin, in which ‘calculus’ means ‘stone’ — the link is in 
the use of stones for counting. 


The calculus of infinitesimals became so overwhelmingly important 
compared to other types of calculus that the word ‘calculus’, used 
alone, is now always understood to refer to it. 


An ‘infinitesimal’ was regarded as an infinitely small part of 
something. When you consider an object’s velocity, for example, in 
the calculus of infinitesimals, you don’t consider its average velocity 
over some period of time, but rather its ‘instantaneous velocity’ — the 
velocity that it has during an infinitely small interval of time. 


Although the ideas behind calculus are explained in quite a lot of detail in 
this unit, and in the later calculus units in the module, many of the 
explanations aren’t as mathematically precise and rigorous as it’s possible 
to make them, and the proofs of some results and formulas aren’t given at 
all. For example, the idea of a limit of an expression is introduced, but this 
idea is described in an intuitive way: the unit doesn’t give a precise 
mathematical definition. 


The reason for this is that it’s quite complicated to make the ideas of 
calculus absolutely precise. At this stage in your studies it’s not 
appropriate for you to learn about how this can be done, because the 
necessary profusion of small details would make it harder for you to 
understand the main ideas. Instead, the precise mathematical ideas behind 
calculus are covered in the subject area known as real analysis. You might 
study this subject at Level 2, depending on your chosen study programme. 


The fundamental ideas of calculus were developed in the 1600s, 
independently by Isaac Newton in England and Gottfried Wilhelm 
Leibniz in Germany. (‘Leibniz’ is pronounced as ‘Libe-nits’.) Neither 
Newton nor Leibniz made their ideas rigorous — this work was done 
later by other mathematicians. There’s more about Newton and 
Leibniz later in the unit. 
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1 What is differentiation? 


This first section introduces you to the fundamental ideas of differential 
calculus. 


1.1 Graphs of relationships 


Throughout Units 6, 7 and 8, you'll learn the ideas of basic calculus by 
thinking in terms of graphs. As you’ve seen, the relationship between two 
quantities, such as the time that a man has been walking and his 
displacement, can be represented by a graph. For example, the two 
displacement-—time graphs in Figure 2 represent a man’s walk along a 
straight path, on two different occasions. 
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Figure 2 Displacement-time graphs for a man walking along a straight 
path (a) with constant velocity (b) with decreasing velocity 


As you can see, the first graph is a straight line. You saw in Unit 2 that if 
a graph representing the relationship between two quantities is a straight 
line, then the quantity on the vertical axis is changing at a constant rate 
with respect to the quantity on the horizontal axis, and the gradient of the 
graph is this constant rate of change. So the first graph represents a walk 
in which the man’s rate of change of displacement with respect to time, 
that is, his velocity, is constant. The gradient of the graph is his constant 
velocity — you can see that it is about 6kmh7!. 


The second graph is curved. This graph represents a walk in which the 
man gradually slows down as time goes on — you can see that the longer 
he’s been walking, the less his displacement changes in a given period of 
time. 


Activity 1 Working with a curved displacement-time graph 


Estimate from the graph in Figure 2(b) roughly how far the man walks in 
the first half-hour and in the final half-hour of his walk. 


If a graph that represents the relationship between two quantities is curved, 
then there’s no single gradient value that applies to the whole graph. 
However, the graph has a gradient at each point on the graph — you’ll learn 
in the next subsection how this is defined. As with a straight-line graph, 
the gradient at each point is the rate of change of the quantity on the 
vertical axis with respect to the quantity on the horizontal axis; the only 
difference is that it takes different values at different points. For example, 
in Figure 2(b), the gradient of the graph at each point is the man’s 
velocity at a moment in time, and it gradually decreases as time goes on. 


So in this unit you’ll begin your study of calculus by looking at how the 
idea of gradient applies to curved graphs. 


All the relationships between two variables that you’ll consider in the 
calculus units in this module are those in which one variable is a function 
of the other. (For instance, in the example of the man’s walk, his 
displacement is a function of the time that he’s been walking.) So, in every 
graph in these units, every value on the horizontal axis corresponds to at 
most one value on the vertical axis. As in Unit 3, we’ll use the word 
‘function’ to mean ‘real function’; that is, a function whose input and 
output values are real numbers. This is the only type of function that we’ll 
deal with in the calculus units in this module. 


1.2 Gradients of curved graphs 


As mentioned above, in this section you'll learn about gradients of curved 
graphs. We’ll begin with a quick reminder about gradients of straight 
lines, as these ideas will be crucial in what follows. 


As you saw in Unit 2, the gradient of a straight line is a measure of how 
steep it is. To calculate the gradient of a straight line, you choose any two 
points on it and find the run and the rise from the first point to the 
second point. The run is the change in the x-coordinates, and the rise is 
the change in the y-coordinates, as illustrated in Figure 3. Then 

gradient = slate 

run 

The run and the rise from one point to another on a straight line can be 
positive, negative or zero, depending on whether the relevant coordinates 
increase, decrease or stay the same from the first point to the second point. 
If the run is zero, which happens when the line is vertical, then the 
gradient of the line is undefined, since division by zero isn’t possible. 
Vertical lines are the only straight lines that don’t have gradients. 
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Figure 3 The run and the 
rise between two points on a 


straight-line graph 
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If the two points that you choose to calculate the gradient of a line are 
(11, yi) and (x2, y2), as illustrated in Figure 4, then 


run = z2 — xı and rise = y2— yı. 
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Figure 4 The run and the rise in terms of the coordinates of the two 
points 


These equations give the formula below. 


Gradient of a straight line 

The gradient of the straight line through the points (x1, y1) and 
(x2, y2), where z1 Æ z2, is given by 

rise y2— Yi 


gradient = : 
Tun diy = IEN 


Figure 5 The straight line 
through the points (1,8) and 


(5, 2) 
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Remember that when you use this formula to calculate the gradient of a 
straight line, it doesn’t matter which point you take to be the first point, 
(21, yi), and which you take to be the second point, (x2, y2), as you get the 
same result either way. For example, using the formula to calculate the 
gradient of the line through the points (1,8) and (5,2), which is shown in 
Figure 5, gives either 
di a eee aa —6 3 
gradient = -n 51 a 
or 
rise 8—2 6 3 


dient = — = — : 
Sragen run 1-5 —4 2 


You saw the following facts in Unit 2. A straight-line graph that slopes 
down from left to right has a negative gradient, one that’s horizontal has a 
gradient of zero, and one that slopes up from left to right has a positive 
gradient. The steeper the graph, the larger the magnitude of the gradient. 


Activity 2 Relating gradients to graphs 


Match each of the following descriptions to the appropriate graph. 
(a) Large negative gradient (b) Small negative gradient 
(c) Zero gradient (d) Small positive gradient 


(e) Large positive gradient 


Now let’s look at how the idea of gradient applies to curved graphs. For 
example, consider the graph of the equation y = x”, which is shown in 
Figure 6. 


Figure 6 The graph of y = x? 


This graph has no single gradient value, since it’s not a straight line, but if 
you choose any particular point on it, then it has a gradient at that point. 
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Figure 8 A point ona 
straight-line graph 
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To understand what’s meant by the gradient of a curved graph at a 
particular point, consider the point on the graph of y = x? marked in 
Figure 7(a). Imagine that you’re tracing your pen tip along the graph, but 
when it reaches the marked point you just carry on moving it in the 
direction in which it’s been moving, instead of following the graph. Then it 
will move along the straight line drawn in Figure 7(b). No matter whether 
you trace your pen tip along the graph towards the point from the left or 
the right, you’ll end up moving it along the same straight line, just in 
different directions. This straight line ‘just touches’ the graph at the 
marked point, and is called the tangent to the graph at that point. 


Figure 7 (a) A particular point on the graph of y = x? (b) the tangent to 
the graph of y = x? at that point 


The word ‘tangent’ comes from the Latin word ‘tangere’, which 
means ‘to touch’. The English word ‘tangible’, which means ‘capable 
of being touched’, comes from the same Latin word. You might 
remember from Unit 1 that the word ‘integer’ also comes from this 
Latin word (an integer is a whole, or ‘untouched’, number). 


The tangent to any curved graph at a particular point can be defined in a 
similar way. Because the tangent to a graph at a point has the same 
steepness as the graph at that point, we make the following definition. 


The gradient of a graph at a particular point is the gradient of the 
tangent to the graph at that point. 


Although this definition is made with curved graphs in mind, it also 
applies to straight-line graphs. To see this, consider a point on a 
straight-line graph, as illustrated in Figure 8. If you trace your pen tip 
along the graph towards the point and continue moving it in the same 
direction when you get to the point, then it will just continue moving 
along the graph. So the tangent to the graph at that point is just the 


straight-line graph itself, and hence the gradient of the graph at the point 
is just the gradient of the whole straight line, as you’d expect. 


It’s possible for a graph to have no gradient at a particular point. For 
example, consider the graph of y = |x|, which is shown in Figure 9(a). 
Imagine tracing your pen tip along this graph towards the origin, and 
continuing to move it in the same direction when you get to the origin. If 
you trace it along the graph towards the origin from the left, then you’ll 
end up moving it along a different straight line than if you trace it along 
the graph towards the origin from the right. So there’s no tangent to the 
graph at the origin, and hence the graph doesn’t have a gradient at this 
point. In general, if a graph has a ‘sharp corner’ at a point, then it has no 
gradient at that point. 


As another example, consider the graph in Figure 9(b). You can trace your 
pen tip along the graph to the origin from the left, but you can’t do the 
same from the right, because you can’t reach the origin that way. So this 
graph has no tangent at the origin, and hence has no gradient at the 
origin. In general, if a graph has a ‘break’ (known as a discontinuity) at 
a particular point, then the graph has no tangent, and hence no gradient, 
at this point. 


As a third example, consider the graph in Figure 9(b). It has a tangent at 
the origin, but the tangent is vertical, and hence itself has no gradient. So 
this graph has no gradient at the origin. In general, if a graph has a 
vertical tangent at a point, then the graph has no gradient at that point. 


Figure 9 Graphs with points at which there is no gradient 


Now let’s think about how you could find the gradient of a curved graph at 
a particular point at which it does have a gradient. An obvious thing to do 
would be to draw the tangent at that point, choose two points on it and 

use them to calculate the gradient in the usual way, but this wouldn’t be a 
very accurate method, as it’s difficult to draw a tangent accurately by eye. 


The next subsection shows you a better method. It considers the 


particular example of the graph of y = «?. 
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1.3 Gradients of the graph of y = x? 


The graph of y = x? has a gradient at every point, because it has a 
tangent, which is not vertical, at every point. 


Let’s try to find the gradient of this graph at the point (1,1), which is 
shown in Figure 10. That is, we want to find the gradient of the tangent to 
the graph at this point, which is also shown in Figure 10. 
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Figure 10 The point (1,1) on the graph of y = z? and the tangent at 
this point 


The way to find the gradient at (1,1) is to begin by thinking about how 
you could find an approximate value for this gradient. Here’s how you can 
do that. You choose a second point on the graph of y = x”, fairly close to 
(1,1), as illustrated in blue in Figure 11. The straight line that passes 
through both (1,1) and the second point is an approximation for the 
tangent to the graph at (1,1). So the gradient of this line, which you can 
calculate using the two points on the line in the usual way, is an 
approximation for the gradient of the graph at (1,1). 


Figure 11 An approximation to the tangent to the graph of y = x? at 
(1,1) 

The closer to (1,1) you choose the second point to be, the better the 
approximation will be. For example, the second point in Figure 12 will 
give a better approximation than the second point in Figure 11. 


Figure 12 Another approximation to the tangent to the graph of y = x? 
at (1,1) 


As an example, let’s calculate the approximation to the gradient at (1, 1) 
that you get by choosing the second point to be the point with 
a-coordinate 1.1, which is the point shown in blue in Figure 12. Since the 
equation of the graph is y = x”, the y-coordinate of this second point is 
1.1? = 1.21. So the second point is (1.1, 1.21). The gradient of the line 
through (1,1) and (1.1, 1.21) is 
rise _ 121-1 _0.21_,, 
run il1-1 O01 7 
So an approximate value for the gradient of the graph at (1,1) is 2.1. 


The second point that you choose on the graph can lie either to the left or 
to the right of (1,1). In the next activity, you’re asked to calculate the 
approximation to the gradient that you get by choosing a second point on 
the graph that lies to the left of (1,1), as illustrated in Figure 13. 


Figure 13 Yet another approximation to the tangent to the graph of 
y = x° at (1,1) 
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Activity 3 Calculating an approximation to the gradient at a point 


Calculate the approximation to the gradient of the graph of y = x? at the 
point with x-coordinate 1 that you get by taking the second point on the 
graph to be the point with x-coordinate 0.9. 


Notice that although you can take the second point to be as close to (1, 1) 
as you like, either on the left or on the right, you can’t take it to be 
actually equal to (1,1). This is because you can’t apply the formula for 
gradient to the points (1,1) and (1,1), since that would involve division by 
zero, which is meaningless. 


In the next activity, you can use a computer to see the gradients of some 
more lines that pass through (1,1) and a second point on the graph close 
to (1,1), calculated by using the coordinates of the two points. 


Activity 4 Calculating more approximations to the gradient at a 
point 


Open the applet Approximations to the gradient at a point. Make sure that 
the function is set to f(x) = x? and that the original point on its graph is 
set to the point with x-coordinate 1. Move the second point closer and 
closer to the original point, and observe the gradients of the resulting lines, 
which are calculated by the computer using the coordinates of the two 
points. What do you think is the gradient of the tangent at (1,1)? 


Table 1 gives the gradients of some of the lines that you might have seen in 
Activity 4. 


Table 1 The gradients of lines through (1,1) and a second point close to 
(1,1) on the graph of y = z? 


x-coordinate 
of second point 0.8 0.9 0.99 0.999 1.001 1.01 1.1 1.2 
Gradient of line 1.8 1.9 1.99 1.999 2.001 2.01 2.1 2.2 


From Activity 4 and Table 1, it looks as if the closer the second point on 
the graph is to (1,1), the closer the gradient of the line through (1,1) and 
the second point is to 2. So it looks as if the gradient of the graph at (1,1) 
might be exactly 2. 


Neither Activity 4 nor Table 1 shows this for certain, however. For 
example, Table 1 doesn’t rule out the possibility that the gradient at (1,1) 
might be some other number between 1.999 and 2.001. However, you can 
confirm that the gradient is exactly 2 by using an algebraic version of the 
method above. 


Let’s use the variable h to denote the increase in the x-coordinate from the 
original point (1,1) to the second point. The value of h can be either 
positive or negative, but not zero, since the second point can be either to 
the right or to the left of (1,1), but can’t be (1,1) itself. The x-coordinate 
of the second point is 1+ h, so, since the equation of the graph is y = 2”, 
the y-coordinate of the second point is (1 + h)?. These coordinates are 


shown in Figure 14. 


Figure 14 The points (1,1) and (1 +h, (1+h)?) on the graph of y = x°, 
and the line through them 


The gradient of the line that passes through the two points is 
rise (1+h)?-1 


run (1+h)—1° 
This expression can be simplified as follows: 
(1+ h)?-1 
(ea = 
14+2h+h?-1 

1+h-1 
2h + h? 

h 
h(2+h) 

h 
=2 +h. 


gradient = 


So the gradient of the line that passes through (1,1) and (1 + h, (1 + A)?) 
is given by the expression 2 + h. (You can see examples of this in Table 1.) 


Now think about what happens to this gradient as the second point gets 
closer and closer to (1,1). That is, think about what happens as the value 
of h gets closer and closer to zero. 


As the value of h gets closer and closer to zero, the gradient of the line, 
2 + h, gets closer and closer to 2. So the gradient of the graph at (1,1) 
must indeed be exactly 2, as we expected. 
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The fact that, as h gets closer and closer to zero, the value of the expression 
2 + h gets closer and closer to 2 is expressed mathematically by saying that 


2+h tends to 2 as h tends to 0, 
or 


the limit of 2+ h as h tends to zero is 2. 


So we’ve now succeeded in finding the gradient of the graph of y = x? at 
the point (1,1). You could use the same algebraic method to find the 
gradient of the graph at any other point. 


A cleverer thing to do, however, is to use the same method to find the 
gradient of the graph of y = x? at a general point on the graph, whose 
x-coordinate is denoted by x. This will give you a formula for the gradient 
of the graph at any point, in terms of the x-coordinate of the point. You 
can then find the gradient of the graph at any particular point by just 
substituting into the formula, instead of having to go through the algebraic 
method again. 


So let’s now apply the algebraic method to find the gradient of the graph 
of y = x? at the general point whose z-coordinate is denoted by x. The 
situation is shown in Figure 15. The point at which we want to find the 
gradient is (2,27). The second point on the graph is (2 + h, (x +h)?), 
where h is a positive or negative number, but not zero. 
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Figure 15 The points (x, x?) and (x + h, (x + h)?) on the graph of 
y = x”, and the line through them 
The gradient of the line that passes through the two points is 

rise (+h)? — 2? 


run (cx+h)—a ` 
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This expression can be simplified as follows: 
(2 +h)? — x? 
(z+h)- z 
L? + Wh +h? — r? 
r+h-rzr 
2th + h? 
h 
h(2x +h) 
h 
=2r+h. 


gradient = 


As the second point gets closer and closer to the original point, the value 
of h gets closer and closer to 0, and the gradient of the line, 2x + h, gets 
closer and closer to 2x. So the gradient of the graph of y = x? at the point 
whose x-coordinate is denoted by x is given by the formula 


gradient = 22. (1) 


For example, this formula tells you that at the point with x-coordinate 1, 
gradient = 2 x 1 = 2, 
as we found earlier. 


In the next activity you’re asked to use formula (1) to find the gradient of 
the graph of y = x? at two more points. 


Activity 5 Finding the gradients of the graph of y = x? at particular 
points 


Use formula (1) to find the gradient of the graph of y = x? at the point 
with x-coordinate 3, and at the point with x-coordinate —1.5. 


Figure 16 shows the tangents to the graph of y = x? at the points with 
z-coordinates 3 and —1.5. You can see, by estimating the rise and run 
between two points on each tangent, that the gradients of these tangents 
do seem to be at least roughly the values calculated in the solution to 
Activity 5. 


In general, if you have the equation of a graph, then it’s often possible to 
use an algebraic method similar to the one that you’ve seen in this 
subsection to find a formula for the gradient of the graph. You'll see some 
more examples in the next subsection. 


Figure 16 Two tangents to 
the graph of y = x? 
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1.4 Derivatives 


In Subsection 1.3 you saw how to find a formula for the gradients of the 
graph of y = x”. In this subsection you'll see how you can use the same 
method to find formulas for the gradients of some other graphs. Before you 
do that, it’s useful for you to learn some terminology and notation that are 
used when working with such formulas. 


Consider a function f and a particular input value x. The point on the 
graph of f that corresponds to the input value z is (x, f(x)). If the graph 
of f has a gradient at the point (x, f(a)), then we say that f is 
differentiable at x. For example, you’ve seen that the function f(x) = x 
is differentiable at every value of x. If the graph of f doesn’t have a 
gradient at the point (x, f(x)) (because the graph has no tangent at that 
point, or because the tangent is vertical), then f isn’t differentiable at x. 
For example, you’ve seen that the function f(x) = |x| isn’t differentiable 
at 0. Similarly, if a function f isn’t even defined at a particular input 
value x, then it’s not differentiable at x. 


2 


You’ve seen that, usually, the gradient of the graph of a function f varies 
depending on which value of x you’re considering. It’s convenient to think 
of these gradients as defining a new function, related to f. The rule of the 
new function is: 


if the input value is x, then the output value is 
the gradient of the graph of f at the point (x, f(x)). 


This new function is called the derivative (or derived function) of the 
function f, and is denoted by f’ (which is read as ‘f prime’ or ‘f dash’ or 
‘f dashed’). The domain of the derivative consists of all the values at 
which f is differentiable. The process of finding the derivative of a given 
function f is called differentiation, and when we carry out this process, 
we say that we’re differentiating the function f. 


For example, in the previous subsection we differentiated the function 
f(x) = x, and we found that its derivative has the rule f'(x) = 2z. 


The word ‘derivative’ can be applied to expressions, as well as to functions. 
For example, rather than saying that 


the derivative of f(x) = x? is f'(x) = 2x, 
you can simply say that 


the derivative of x? is 22. 


The word ‘derivative’ is also used with a slightly different meaning. The 
value of the derivative of a function f at a particular input value x is 
called the derivative of f at x. For example, you saw in the last 
subsection that the derivative of the function f(x) = x? at 1 is f’(1) = 2. 


The notation for derivatives can of course be used with letters other than 
the standard ones, f for the function and x for the input variable. For 
example, if the function g is given by g(t) = t?, then the gradient of the 


graph of g at the point (t, g(t)) is given by the formula 2t, and we express 
this by writing 


g(t) = 2t. 


You should now be ready to see some more examples of finding formulas 
for the gradients of graphs — that is, formulas for derivatives. We’ll begin 
by setting out the general method that can be used to find such formulas. 
This is just the algebraic method that you saw in Subsection 1.3, where 
the formula for the derivative of the function f(x) = x? was found. It’s 
known as differentiation from first principles. 


Differentiation from first principles 


Suppose that f is any function. Let x denote any value in the domain of f 
such that f is differentiable at x (that is, such that the graph of f has a 
gradient at the point (x, f(x)), as illustrated in Figure 17. Now consider a 
second point on the graph, with coordinates (x + h, f(a +h)), where h is a 
positive or negative number, but not zero. 


Figure 17 The points (x, f(x)) and (x +h, f(x +h)) on the graph of 
y = f(x), and the line through them 


The gradient of the line that passes through the two points is 
rise fie@+h)— f(x) 


run (z+h)=x ’ 
and this expression can be simplified slightly to give 
f(a+h)— f(x 
Gth- fle) o 


This expression is known as the difference quotient for the function f at 
the value x. As the second point (x + h, f(x + h)) gets closer and closer to 
the first point (x, f(x)), that is, as the value of h gets closer and closer to 
zero, the value of the difference quotient gets closer and closer to the 
gradient of the graph at the point (x, f(x)). That is, it gets closer and 
closer to f'(x). (Remember that the value of h can never actually be 0.) 
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So, to find a formula for f'(x), you need to consider what happens to the 
difference quotient for f at x, as h gets closer and closer to zero, taking 
either positive or negative values as it does so. In other words, you need to 
find, in terms of x, the limit of the difference quotient as h tends to zero. 
You saw this procedure carried out for the function f(x) = x? in the last 
subsection, and in the next example you'll see it carried out for the 
function f(x) = x3. 


Example 1 Differentiating from first principles 


Differentiate from first principles the function f(x) = x°. 


Solution 

@. Write down the difference quotient and use the fact that 
f(x) = x°. @ 

The difference quotient for the function f(x) = x? at x is 


fle+h)— fle) _ Gth- 
h h 
@. Simplify the difference quotient. Start by multiplying out the term 
(x +h)? in the numerator. @ 
Multiplying out (x + h)? gives 
(£z +h)? = (£ +h) +h) 
= (e t Me nEn) 
= nlr + 2an +h?) + h(a? anA h?) 
=e 27h Pan ha he en he 
= i 3y ht Bah? oh. 


So 
f(c@+h)— f(z) 23 4+307h + 3ch? +h — r’ 
h 7 h 
_ 3h + 3xh? +h’ 
a 
_ h(8a? + 32h +h?) 
h 


= 377 + 32h + h?. 
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@®. Work out what happens to the value of the difference quotient as 
h gets closer and closer to zero. & 


The second term in the final expression above contains the factor h, 
and the third term is h?, so as h gets closer and closer to zero, both of 
these terms get closer and closer to zero. So the value of the whole 
expression gets closer and closer to the value of the first term, 32. 
That is, the formula for the derivative of the function f(x) = x? is 


(eo) = 32. 


The formula for the derivative of f(x) = z3 found in Example 1 tells you 
that, for example, the gradient of the graph of y = x? at the point with 
x-coordinate 1 is 

filjSs x1 =3, 


and the gradient of this graph at the point with x-coordinate i is 
1 1\2_ 3 
f(g) =3x (3) = 7. 


You can see from Figure 18 that the gradients of the tangents to the graph 
at these two points do seem to be the numbers calculated using the 
formula. 


In the next activity, you’re asked to use differentiation from first principles 
to find the derivative of the function f(x) = z4. 


Activity 6 Differentiating from first principles 


(a) Multiply out the expression (x + h)*. To do this, start by writing 
(x +h)* = (x+h)(x +h). Then replace the expression (x + h)? by 
the expansion of (x + h)? that was found in the solution to Example 1, 
remembering to enclose it in brackets. Finally, multiply out the 
brackets and collect like terms. Figure 18 Two tangents to 


(b) Hence differentiate from first principles the function f(x) = x4. the graph of f(x) = z? 


(c) What is the gradient of the graph of the function f(x) = x4 at the 
point with x-coordinate +? 


In each of the three examples of differentiation from first principles that 
you’ve seen so far, the function was differentiable at every value of x. If a 
function isn’t differentiable at some values of x, then you can still 
differentiate it from first principles, but the process won’t work for the 
values of x at which it’s not differentiable. For these values of x the value 
of the difference quotient won’t get closer and closer to a particular 
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number as h gets closer and closer to zero (where h can be either positive 
or negative). 


For example, if the graph of a function f has a sharp corner at (x, f(x)), 
then the difference quotient for f at x will get closer and closer to a 
particular value when A is positive, but will get closer and closer to a 
different value when h is negative. As another example, if the graph of the 
function has a vertical tangent at (x, f(x)), then the magnitude of the 
difference quotient for f at x will just keep getting larger and larger, 
without getting closer and closer to any particular value. 


In general, saying that a function f is differentiable at a particular value 
of x is the same as saying that the difference quotient for f at x tends to a 
limit as A tends to zero. It must tend to the same limit for positive values 
of h as for negative values. 


The method of differentiation from first principles is summarised in the 
box below. The notation ‘im ’ means ‘the limit as h tends to zero of’. 
> 


Differentiation from first principles 
For any function f, the derivative f’ of f is given by the equation 


A +h) — f(x) 

1 = ] Ee 

f (2) = lim — ——— 

for each value of x in the domain of f for which this limit exists. 


Differentiation from first principles can be used to find formulas for the 
derivatives of many of the functions that you'll need to work with. 
However, it’s a laborious process, so usually we don’t do it! Instead, in this 
unit and in the first half of Unit 7 you’ll get to know the formulas for the 
derivatives of a range of standard functions, such as f(x) = x”, f(x) = 2°, 
f(x) = sinz, f(x) = e”, and so on. You'll also learn about ways in which 
you can combine these formulas to obtain formulas for the derivatives of 
other, related functions. For example, if you know the formulas for the 
derivatives of f(x) = x? and f(x) = 2°, then you can combine them to 
obtain a formula for the derivative of f(x) = x? + 23. In these ways you'll 
be able to find formulas for the derivatives of most of the functions that 
you'll need to work with. 


Of course, the idea of differentiation from first principles is still needed, to 
find the derivatives of the standard functions, to check that the rules for 
combining them are valid, and to differentiate functions that aren’t 
standard functions or combinations of standard functions. 


Formulas for derivatives are powerful mathematical tools in many different 
situations, in both pure and applied mathematics. In this module, not only 
will you learn how to find such formulas, but you'll also be introduced to a 
few of the ways in which they can be used. You’ll see many more uses of 
them if you go on to study further modules in mathematics, or some 
modules in areas such as science and economics. 


1 What is differentiation? 


The invention of calculus 


The history of calculus goes back to the second half of the seventeenth 
century, when Isaac Newton in England and Gottfried Wilhelm 
Leibniz in what is now Germany both independently developed the 
basic ideas. Newton’s ideas were rooted in the applications of 
mathematics, while Leibniz’s were rooted in pure mathematics. 


Newton developed the ideas of calculus starting in about 1665. He 
called his ideas the ‘method of fluxions’ and wrote a treatise about 
them in 1671, which was not published in his lifetime, although its 
contents circulated in manuscript form, and a publication containing 
the method appeared in 1704. Leibniz then independently developed 
similar ideas, starting in about 1674. A manuscript that he wrote in 
1675 includes the notation used in integral calculus to this day, as 
well as a standard rule for combining derivatives, the product rule, 
which you'll meet in Unit 7. Leibniz’s notation for differential 
calculus is also still used today, as you'll see shortly. Leibniz first Isaac Newton (1642-1727) 
published work on calculus in 1684. 


The two men continued to develop their ideas for the next few years. 
However by the early 1700s Leibniz was being accused by Newton’s 
associates of having plagiarised Newton’s work. The allegation was 
that Leibniz had seen some of Newton’s unpublished papers and had 
merely invented a new notation for Newton’s ideas. The ensuing 
bitter argument led to a Royal Society investigation, which upheld 
the charge. However, the investigation was largely carried out by 
Newton’s friends, and Newton, who was President of the Royal 
Society, secretly guided its report. Investigations by modern 
historians have shown that the accusation against Leibniz was unjust. 
Newton and Leibniz arrived at equivalent results following different 
paths of discovery. 


Gottfried Wilhelm Leibniz 
(1646-1716) 


Isaac Newton 


Isaac Newton was born in Lincolnshire, and studied and worked at 
the University of Cambridge. He was one of the world’s greatest 
physicists, mathematicians and astronomers, and is remembered in 
particular for his work on classical mechanics. He did much of his 
initial work on calculus at his family home in Lincolnshire, while 
Cambridge University was closed due to an outbreak of plague. In his 
later life Newton largely abandoned physics and mathematics, and 
wrote theological tracts before becoming Master of the Mint, a 
highly-paid government official, in London. He also worked on 
alchemy throughout his life. He was knighted in 1705, but for 
political reasons rather than for his scientific work or public service. 
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Gottfried Wilhelm Leibniz 


Gottfried Wilhelm Leibniz was born in Leipzig, and attended 
university there and in Altdorf. He was a universal thinker who 
graduated in philosophy and law, and was self-taught in mathematics. 
He went on to work intensively on mathematics in Paris, before 
accepting the position of Counsellor and librarian at the court in 
Hanover, where he remained for the rest of his life. While there, he 
worked on many different projects, making important contributions to 
mathematics, philosophy, theology and history. Some of Leibniz’s 
projects were related to his salaried position, but his employers also 
allowed him to work on other projects of his choosing. He was 
interested in formalising calculations, and constructed the first 
mechanical calculator that could add, subtract, multiply and divide. 


Before we begin the process of building up a collection of formulas for the 
derivatives of standard functions, and techniques for combining them, it’s 
useful for you to learn an alternative notation for derivatives. 


Leibniz notation 


The notation that we’ve been using so far, in which the derivative of a 
function f is denoted by f’, is called Lagrange notation or prime 
notation. (‘Lagrange’ is pronounced as a French word: ‘La-grawnge’.) It 
was invented by Joseph-Louis Lagrange, about a century after calculus was 
discovered. The term ‘prime notation’ arises from the fact that the 
symbol ’ in the notation f’ is often called ‘prime’. 


However, there’s another notation, called Leibniz notation, invented by 
Gottfried Wilhelm Leibniz. (Remember that ‘Leibniz’ is pronounced as a 
German word: ‘Libe-nits’.) You’ll need to become familiar with both 
notations, as they’re both used throughout this module, and throughout 
mathematics generally. 


Each type of notation has different advantages in different situations. 
Generally, Lagrange notation is used when you’re thinking in terms of a 
variable, and a function of this variable. On the other hand, Leibniz 
notation is often used when you’re thinking more of the relationship 
between two variables. The distinction will become clearer as you become 
used to working with the two notations. 


1 What is differentiation? 


Joseph-Louis Lagrange 


Joseph-Louis Lagrange was an Italian-French mathematician who 
made important contributions in many areas, including calculus, 
mechanics, astronomy, probability and number theory. He was 
appointed as a professor of mathematics at the Royal Artillery School 
in Turin at the age of only 19, and was a dedicated and prolific 
mathematician for the rest of his life, working mainly in Turin and 
Berlin. 


To see how derivatives are written in Leibniz notation, consider the 
equation y = x”, which expresses a relationship between the variables x 
and y. You’ve seen that the formula for the gradient of the graph of the Joseph-Louis Lagrange 
equation y = x? is (1736-1813) 


gradient = 2a. 
In Leibniz notation this equation is written as 


dy _ 


22. 
dx 4 


d 
So the notation 2 means the same as f'(x), where y = f(x). It’s read as 


da 
d 
‘d y by d x’. When Leibniz notation is being used, T is often referred to 
x 


as the derivative of y with respect to x. 


d 
If you want to write the notation =Y in a line of text, then you can write it 


x 
as dy/dx, just as you would do for a fraction. However, although the 
notation looks like a fraction, it’s important to remember that it isn’t a 
fraction! 


Also, you should be aware that the ‘d’ that’s part of Leibniz notation has 
no meaning outside of it. In particular, although dy and dz look like d x y 
and d x x, respectively, the ‘d’ is certainly not a factor and must not be 
cancelled! In many mathematical texts, including this one, the ‘d’ in 
Leibniz notation appears in upright type, rather than the italic type used 
for variables, to emphasise this fact. (You don’t need to do anything 
special when you handwrite Leibniz notation — you should just write the 
‘q’ in the normal way.) 


To understand the thinking behind Leibniz notation, consider Figure 19. 
It’s exactly the same as Figure 17 on page 225, which illustrates 
differentiation from first principles, except that some things are labelled 
differently. For example, the point at which we want to find the gradient is 
labelled (x, y) instead of (x, f(x)). This is because the emphasis here is on 
the relationship between the variables x and y, rather than on the idea of 
f as a function of x. Another difference is that the change in the 
z-coordinate from the point at which we want to find the gradient to the 
second point is denoted by ôx instead of h, so the x-coordinate of the 
second point is written as x + ôx instead of x + h. 
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The symbol 6 is the lower-case Greek letter delta, and is read as ‘delta’. 
By convention, when the symbol 6 is used as a prefix it indicates ‘a small 
change in’, so ôx denotes a small change in x. The change in the 
y-coordinate from the point at which we want to find the gradient to the 
second point is denoted in a similar way, as dy, which means that the 
y-coordinate of the second point is y + dy. 


YA 


y= f(x) (a + da, y + dy) 


8y 


Figure 19 The points (x,y) and (x + ôx, y + dy) on the graph of 
y = f(x), and the line through them 
The gradient of the line that passes through the two points is then 

rise dy 

run ôr 
As the second point gets closer and closer to the first point, the value of 
dy/da gets closer and closer to the gradient of the curve at the point (x,y). 
In other words, the gradient is the limit of dy/dxz as the second point gets 
closer and closer to the first point (and so as ôx and dy get smaller and 


smaller). This is why the notation dy/dz, which looks similar to dy/dz, 
was chosen to represent the gradient. 


The expression ĝy/ôx can be used instead of expression (2) on page 225 to 
carry out differentiation from first principles, and you might see this done 
in some other texts on calculus. The process is exactly the same, just with 
h replaced by ôx, but it can look a bit more complicated at first sight. 


Leibniz notation can be used in a variety of ways. For example, the symbol 
d 
da 

means ‘the derivative with respect to x of’. So, for example, a concise way 


to express the fact that the gradient of the graph of the equation y = x? is 
given by the formula 2z is to write 


— (z?) = 22. 


As with Lagrange notation, Leibniz notation can be used with variable 
names other than the standard ones, x and y. For example, 


if s=t?, then eS = 2t, 
dt 
and 
eee 2. 8 dp _ 92 
if p = q’, then (by Example 1) a 3q°. 


Similarly (by Activity 6), 
d 
dw 
Sometimes, particularly on a computer algebra system, you might see 
dy/dx written as 


d 
dx” 


(wt) = 4w’. 


Usually, if a function is specified using function notation, then you use 
Lagrange notation for its derivative, whereas if it’s specified using an 
equation that expresses one variable in terms of another, then you use 
Leibniz notation. For example, if you know that f(x) = x”, then you write 
f'(x) = 2a, whereas if you know that y = x”, then you write dy/dax = 22. 
However, there are no absolute rules about this, and in fact it’s often 
helpful to mix the two notations. In particular, it’s often convenient to use 
the symbol d/dx, even when you’re mostly using Lagrange notation, as 
you'll see. 


Lagrange notation and Leibniz notation are the two most common 
notations for derivatives, but there are other useful notations, including 
one invented by Isaac Newton. You’ll meet some of these notations if you 
go on to study calculus beyond this module, particularly in the area of 
applied mathematics. 


Herr Leibniz, what are 
the three most important things 
about a theory of calculus? 
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Functions whose domains include endpoints 


It’s often useful to find the derivative of a function whose domain includes 
an endpoint. For example, the function f(x) = «3/2, whose graph is shown 
in Figure 20, has domain [0, co), which includes 0 as an endpoint. 


Any function whose domain includes an endpoint isn’t differentiable at this 
endpoint. For example, consider the function f(x) = x3/2. The point on its 
graph that corresponds to the endpoint 0 of its domain is the origin. You 
can trace your pen tip along the graph to the origin from the right, but 
you can’t do the same from the left, so the graph doesn’t have a tangent at 
the origin. Hence it doesn’t have a gradient at the origin; that is, it’s not 
differentiable at x = 0. 


However, the graph of f(x) = x3/? does have a ‘tangent on the right’ at 
the origin, in the sense that you can trace your pen tip along the graph 
towards the origin from the right, and continue moving it in the direction 
in which it’s been moving when it reaches the origin. It will then move 
along a straight line, namely the x-axis, in this case. So the graph has a 
‘gradient on the right’ at the origin, namely 0, the gradient of the x-axis. 


Because of this, we say that the function f(x) = x°/? is 
f(z) = 73/2 right-differentiable at x = 0, and that its right derivative at 0 is 0. 
Similarly, a function can be left-differentiable at a particular x-value, 
and it will then have a left derivative at that x-value. 


The left or right derivative of a function at a particular x-value can be 
found by using differentiation from first principles in much the same way as 
the usual, two-sided derivatives. The only difference is that instead of the 
increase in the x-coordinates, h, taking either positive or negative values as 
it gets closer and closer to zero, for a right-sided derivative h takes just 
positive values, and for a left-sided derivative h takes just negative values. 


Saying that a function is differentiable at a particular x-value is the same 
as saying that it has both a left and a right derivative at that 2-value, and 
the left and right derivatives are equal. 


If f is a function whose domain includes one or more endpoints, then we 
adjust the definition of its derivative f’ slightly to allow for these, as 
follows. We include in the domain of f’ not just the values of x at which f 
is differentiable, but also the values of x that are endpoints of the domain 
of f and at which f is left- or right-differentiable. The value of f’ at each 
of these endpoints is the appropriate left or right derivative. 


All the results about derivatives that you’ll meet in this module apply, 
with the appropriate adjustments, to left and right derivatives as well as to 
the usual, two-sided derivatives. For simplicity, this isn’t stated explicitly 
for each individual result. For example, in the next subsection you'll learn 
how to find a formula for the derivative of the function f(x) = x3/?. This 
formula is valid for all values of x in the domain [0,0o) of this function f, 
but the value that it gives for x = 0 is the right derivative of f at 0, rather 
than the derivative of f at 0, which doesn’t exist. 
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Summary of important ideas 


To finish this section, here’s a summary of some of the important ideas 
that you’ve met. 


Derivatives 


The derivative (or derived function) of a function f is the 
function f’ such that 
f'(x) = gradient of the graph of f at the point (x, f(z)). 


The domain of f’ consists of the values in the domain of f at which f 
is differentiable (that is, the z-values that give points at which the 
gradient exists). 


If y = f(x), then f'(x) is also denoted by = 


The derivative f’ is given by the equation 


Fee ee IO) 


The procedure of using this equation to find a formula for the 
derivative f’ is called differentiation from first principles. 


2 Finding derivatives of simple 
functions 


In this section you'll meet the formulas for the derivatives of standard 
functions of a particular type, and you’ll also meet two rules for combining 
formulas for derivatives. 


2.1 Derivatives of power functions 


In the previous section you saw formulas for the derivatives of the functions 
f(a) =2", fla)=c? and f(2)=2". 

Any function of the form 
f(x) =2", 

where n is a real number, is called a power function. In this subsection 


you'll see how to find the formula for the derivative of any power function, 
without having to use differentiation from first principles. 
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The formulas that you saw for the derivatives of the three power functions 
above can be stated as follows, using Leibniz notation: 
d, d 
a J= 22. T aa 
Notice that they all follow the same pattern. In each case, to obtain the 
derivative, you multiply by the power, then reduce the power by 1, as 
shown below. 


(z3) = Bz", (z4) = 4z. 


multiply by reduce the 
22 the power 72 _ power by 1 On 


2 
3 


g? aa fy? >=, By 


rt 


Ag* M Hy 


It turns out that the derivative of every power function follows the same 
pattern. This fact can be stated algebraically as follows. 


Derivative of a power function 


For any number n, 


d jo — n-1 
Te ae 


You can confirm this formula for any particular value of n that’s a positive 
integer by differentiating from first principles. You saw this process carried 
out for the particular power functions f(x) = 2?, f(x) = x? and f(x) = x4 
in the last subsection. In general, for the power function f(x) = x”, the 
process involves multiplying out the expression (x +h)”. (In Unit 10 you'll 
meet a quick way to multiply out expressions like this, namely by using the 
binomial theorem.) 


However, the formula holds not just for values of n that are positive 
integers, but for all values of n, including negative and fractional values. 
YA Yov’ll see a proof of this fact in Unit 7. 


As you’d expect, for any particular value of n, the formula holds for all 
values of x for which the function f(x) = x” is differentiable. For example, 
if n = 2, then the formula holds for all values of x. On the other hand, if 
n= $, then the formula holds only for positive values of x. The function 
f(x) =2'/?, that is, f(x) = yz, whose graph is shown in Figure 21, isn’t 
defined for negative values of x, and isn’t differentiable (or even 
right-differentiable) at x = 0. 


Figure 21 The graph of 
f(x) = Jz More generally, whenever you see a formula for the derivative of a 


function, such as 
d 
da 
you can assume that the formula holds for all values of the variable 
(usually a) for which the function is differentiable, unless it’s stated 


(a7)=22 or f(x) = 2z, 
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otherwise. You can follow the same convention when you write such 
formulas yourself. Remember in particular that for a function to be 
differentiable at a particular value of x, it must also be defined there. For 
example, the function f(x) = yz isn’t differentiable at x = —3, since it 
isn’t even defined there. 


The next example shows you how to use the general formula for the 
derivative of a power function. In parts (b) and (c), the expression on the 
right-hand side of the rule of the function isn’t given in the form x”, but 
you can use some of the index laws that you met in Unit 1 to rewrite it in 
this form. The two index laws that are the most useful for this sort of 
situation are repeated in the box below. 


Two index laws 


ar 


For example, using these laws you can write 1/3 as x73, and @/@ as z". 


In the solution to the example, Lagrange notation is used where the 
function is specified using function notation, and Leibniz notation is used 
where it is specified by an equation expressing one variable in terms of 
another. You should do likewise in the activity that follows the example. 
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Figure 22 The graph of 
y = yz, and its tangent at 
the point with x-coordinate 4 
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As illustrated in Example 2, whenever you find the derivative of a 
function, you should simplify your answer, if possible. As is often the case 
with algebraic simplifications, there may be no ‘right answer’ for the 
simplest form, as any of several different possibilities might do. The 
derivatives of the functions in Example 2 and the next activity contain 
indices — you saw some general guidelines for simplifying expressions that 
contain indices in Subsection 4.3 of Unit 1. 


Activity 7 Differentiating power functions 


Differentiate the following functions. 


(a) f@)=e (b) f@=2 © f@=yQ @ fe)= 73/2 
(ce) f@)== (f) f(@@)=29? — &) f@)=2 (hy) f(a) = = 
Ossa Mya" Wu Wye 

(m) y=? m) f@)=2? ©) f@)=aq Wro 


Activity 8 Finding the gradient at a point on the graph of a power 
function 


Use the solution to Example 2(c) to find the gradient of the graph of 
y = Va at the point with x-coordinate 4. (The tangent to the graph at 
this point is shown in Figure 22.) 


2 Finding derivatives of simple functions 


Notice that the formula for the derivative of a power function tells you 
that the function 


f(x) =x (which is the same as f(x) = x!) 
has derivative 
f'(z)=1x2°; thatis, f’(«)=1. 


This is as you’d expect, because the graph of the function f(x) = x (shown 
in Figure 23(a)) is a straight line with gradient 1, which means that the 
gradient at every point on the graph is 1. 


Another simple power function is the constant function 
f(x) =1 (which is the same as f(x) = x°). 


Its graph (shown in Figure 23(b)) is a straight line with gradient 0, so the 
gradient at every point on its graph is 0. So its derivative is simply 


f'(£)=0. 


Figure 23 The graphs of (a) f(x) = x (b) f(x) =1 


Note that to make the general formula work for the functions f(x) = x and 
f(x) =1 when x = 0, we have to assume that 0° = 1. The value of 0° was 
discussed in Unit 1. 


2.2 Constant multiple rule 


In this subsection and the next, you’ll see two ways in which you can use 
formulas that you know for the derivatives of functions to find formulas for 
the derivatives of other, related functions. 


First, suppose that you know the formula for the derivative of a particular 
function, and you want to know the formula for the derivative of a 
constant multiple of the function. For example, you already know the 
formula for the derivative of the function f(x) = x”, but suppose that you 
want to know the formula for the derivative of the function g(x) = 32°. 
Let’s think about how the formula for the derivative of the second function 
can be worked out from the formula for the derivative of the first function. 
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When you multiply a function by a constant, the effect on its graph is 
that, for each xz-value, the corresponding y-value is multiplied by the 
constant. So the graph is stretched or squashed vertically, and, if the 
constant is negative, then it’s also reflected in the z-axis. As you saw in 
Unit 3, these effects are called vertical scalings. For example, Figure 24 
shows the graphs of y = x”, y = 32, y= 5x? and y = —2x?, and the point 
with x-coordinate 1 on each of these graphs. 


You can see the following effects. 


e Multiplying the function f(x) = x? by the constant 3 scales its graph 
vertically by a factor of 3 (which stretches it). 


e Multiplying the function f(x) = x? by the constant i scales its graph 
vertically by a factor of - (which squashes it). 


e Multiplying the function f(x) = 2? by the constant —1 scales its graph 
vertically by a factor of —1 (which reflects it in the x-axis). 


y= sz y = 3a? y= 52 ae 


(a) (b) (c) (d) 


Figure 24 The graphs of (a) y = x? (b) y = 32? (c) y = $2? (d) y = —2? 


The stretching or squashing, and possible reflection, of the graph causes 
the gradient at each x-value to change. For example, you can see that, at 
the point with z-coordinate 1, the graph of y = 3x? is steeper than the 
graph of y = 2”. 


To see exactly how the gradients change, first consider what happens to 
the gradient of a straight line when you scale it vertically by a particular 
factor, say a. The scaled line will go up by a times as many units for every 
one unit that it goes along, compared to the unscaled line. In other words, 
its gradient is multiplied by the factor a. For example, Figure 25(a) 
illustrates what happens when you take a straight line with gradient 1 and 
scale it vertically by a factor of 3. 


The same thing happens for any graph: if you scale it vertically by a 
particular factor, then its gradient at any particular x-value is multiplied 
by this factor. For example, Figure 25(b) illustrates that if you take a 
curve that has gradient 1 at a particular x-value, and scale it vertically by 
a factor of 3, then the new curve has gradient 3 at that x-value. 


2 Finding derivatives of simple functions 


Yr oF UR Uh 


(a) (b) 


Figure 25 (a) A straight line with gradient 1, and the result of scaling it 


vertically by a factor of 3 (b) a curve that has gradient 1 at a particular 
a-value, and the result of scaling it vertically by a factor of 3 


So, if you multiply a function by a constant, then its derivative is 
multiplied by the same constant. This fact can be stated as in the box 
below. 


Constant multiple rule (Lagrange notation) 


If the function k is given by k(x) = af(x), where f is a function and a 
is a constant, then 


k(x) =af'(2), 


for all values of x at which f is differentiable. 


For example, since the derivative of f(x) = x? is f'(x) = 2x, it follows by 
the constant multiple rule that 

e the derivative of g(x) = 32? is g'(x) = 3 x 2x = 6x 

e the derivative of h(x) = $2? is h'(x) = 4 x 2x =z 

e the derivative of p(x) = —2? is p'(x) = —2z. 

The third of these follows because taking a negative is the same as 
multiplying by —1. It’s useful to remember in general that if f and k are 
functions such that k(x) = — f(x), then, by the constant multiple rule, 


k'(x) = = f'(x), 
for all values of x at which f is differentiable. 


Like everything involving derivatives, the constant multiple rule can also 
be stated in Leibniz notation, as follows. 
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Constant multiple rule (Leibniz notation) 


If y = au, where u is a function of x and a is a constant, then 
dy du 
— i U=; 
dz daz 


for all values of x at which u is differentiable. 


(The phrase ‘u is differentiable’ in the box above is a condensed way of 
saying that if we write u = f(x) then f is differentiable at x.) 


The constant multiple rule can be proved formally by using the idea of 
differentiation from first principles, and you’ll see this done at the end of 
this subsection. First, however, you should concentrate on learning to use 
it. Here’s an example. 


CS Example 3 Using the constant multiple rule 
— Differentiate the following functions. 
3 
(a) f(z) =8e4 (b) f(@@)=-ve@ (c) y= Z 


Solution 
(a) ®. The derivative is 8 times the derivative of zt. ©@ 
fa) =8 4G = 326 
(b) &. The derivative is the negative of the derivative of yz. ® 
iG) == 0 
1 1 1 1 1 
fe) =f =i n A 


(c) ®. The derivative is 3 times the derivative of 1/x. ® 


Here are some examples for you to try. Notice that in some of them the 
letters used are not the standard ones, x, y and f. 
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Activity 9 Using the constant multiple rule 


Differentiate the following functions. 


(a) f(x) =5e (b) fe)=-27 (©) Fle) =2VF 

() f@)=62 (©) fe? OFO=F e gu) = -Tu 
Ws- Os- Ov- Wes 

Q usá (m) p= (a) a= -iari 


Activity 10 Using the constant multiple rule to find a gradient 


Find the gradient of the graph of the function f(x) = 3x2 at the point with 
a-coordinate 2. 


You saw in the previous subsection that the function 


f(z) =1 
has derivative 
f'(z) =0. 


This fact, together with the constant multiple rule, tells you that if a is 
any constant, then the function 


f(x) =a (which is the same as f(x) =a x 1) 
has derivative 
f(s) =ex0=0. 
For example, the function f(x) = 3 has derivative f'(x) = 0. 


This is as you would expect, because the graph of the function f(x) = a 
(which is illustrated in Figure 26, in the case where a is positive) is a 
straight line with gradient 0, which means that the gradient at every point 
on the graph is 0. 


This fact about the derivative of a constant function can be stated as 
follows. 


Derivative of a constant function 


If a is a constant, then 


d 


Sv 


Figure 26 The graph of 
f(x) =a 
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To finish this subsection, here’s a formal proof of the constant multiple 
rule, using differentiation from first principles. It uses the Lagrange 
notation form of the constant multiple rule, which is repeated below. 


Constant multiple rule (Lagrange notation) 


If the function k is given by k(x) = af(x), where f is a function and a 
is a constant, then 


k'(x) = af'(z), 


for all values of x at which f is differentiable. 


A proof of the constant multiple rule 

Suppose that f is a function and a is a constant. Consider the function k 
given by k(x) = af(x). Let x be any value at which f is differentiable. To 
find k(x), you have to consider what happens to the difference quotient 
for k at x, which is 


k(x + h) — k(x) 
h 


(where h can be positive or negative but not zero), as h gets closer and 
closer to zero. Since k(x) = af (x), the difference quotient for k at x is 
equal to 


af(x +h) — af(z) 
h ti 
which is equal to 
(= i= m) 
a| = ]. 
h 
The expression in the large brackets is the difference quotient for f at x, so, 


as h gets closer and closer to zero, it gets closer and closer to f'(x). Hence 
the whole expression gets closer and closer to af'(x). In other words, 


k'(x) =af (£x), 


which is the constant multiple rule. 


2.3 Sum rule 


Now suppose that you know the formulas for the derivatives of two 
functions, and you want to know the formula for the derivative of their 
sum. For example, you already know the formulas for the derivatives of the 
functions f(x) = x? and g(x) = x?, but suppose that you want to know the 
formula for the derivative of the function k(x) = x? + zè. 
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When you add two functions, the y-coordinates of the points on the two 
graphs are added. For example, Figure 27 shows the graphs of f(x) = x°, 
g(x) = x? and k(x) = x? + x3, and the point with z-coordinate 1 on each 


of the three graphs. 


y =r? y =r? y =r? +23 
YA YA YA 
(4,2) 
(44) (44) 
> > > 
Fi T Fil 


(a) (b) (c) 


Figure 27 The graphs of two functions, and the graph of the sum of 
these functions 


The effect that adding the functions has on the gradients is quite 
complicated to think about, but in fact it’s just what you might expect: 
if you add two functions, then the gradient of the sum function at any 
particular x-value is the sum of the gradients of the two original functions 
at that x-value. 


For example, Figure 28 shows the tangents at the three points marked in 
Figure 27. You can see that they have gradients 2, 3 and 5 (the sum of 2 
and 3), respectively. 


y=u y=" y=r? +r’ 


(a) (b) (c) 


Figure 28 Tangents to the graphs in Figure 27 
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The general rule can be stated as follows. 


Sum rule (Lagrange notation) 

If k(x) = f(x) + g(x), where f and g are functions, then 
k'(x) = f'(x) + g'(x), 

for all values of x at which both f and g are differentiable. 


For example, since the derivative of f(x) = x? is f'(x) = 2x, and the 
derivative of g(x) = x? is g'(x) = 3x?, it follows from the sum rule that the 
derivative of k(x) = x2 + x3 is 


ki (x) = 2x + 327. 
The sum rule extends to sums of larger numbers of functions. For example, 
if k(x) = f(z) + g(x) + A(z), then 

k' (x) = f(x) + g'(x) +h'(2), 
for all values of x at which all of f, g and h are differentiable. 
It also follows from the sum rule, together with the constant multiple rule, 
that if k(x) = f(x) — g(x), then 

k'(x) = f'(x) — g'(x), 
for all values of x at which both f and g are differentiable. You can see 
this by writing the equation k(x) = f(x) — g(x) as 

k(x) = f(x) + (—g(2)). 


Then, since the derivative of —g(a) is —g’(x) by the constant multiple rule, 
it follows by the sum rule that 


ki(x) = f'(x) + (-g'(z)) 
= f'(x) — g'(x), 
as claimed. Again, this fact extends to sums and differences of larger 
numbers of functions. For example, if k(x) = f(x) + g(x) — h(x), then 


k' (x) = f'(x) + g'(x) — h'(x) for all values of x at which all of f, g and h 
are differentiable. 


Here is the sum rule expressed in Leibniz notation. 


Sum rule (Leibniz notation) 


If y = u +v, where u and v are functions of x, then 
dy ou du 


fe oon T 
for all values of x at which both u and v are differentiable. 


2 Finding derivatives of simple functions 


Like the constant multiple rule, the sum rule can be formally proved by 
using the idea of differentiation from first principles, and you'll see this 
done at the end of this subsection. First, however, you need to practise 
using it. 


The next example shows you how to do this. In particular, it illustrates 
that you can sometimes differentiate a function that isn’t expressed as a 
sum of functions by first rearranging its formula so that it is. It also 
illustrates how you can use the sum rule together with the constant 
multiple rule. 


Example 4 Using the sum rule 
Differentiate the following functions. — 


(a) f(z)=z+2s" (b) f(x) = (z+ 2)(a — 5) 


2x — 1 
(c) y= = 


Solution 


(d) y= 5(2? — 2a + 1) 


(a) @. By the sum rule, you can differentiate each term individually 
(and add the results). & 


f(z) r 22/3, so 


jf @)=142x ans 


sii 
32/8" 


(b) &. First write the function as a sum of functions, by multiplying 
out. ® 
f(x) = (x + 2)(x — 5) = z? — 3x — 10, so 
@. Differentiate each term individually. ® 
f'(x) =2r2 —3-0 
= PEF = Oe 
(c) @. First write the function as a sum of functions, by expanding 


the fraction. ® 


2x — 1 2 1 
= SS Be ee | ee 
T w g £ 


®. Differentiate each term individually. Simplify your answer. ©& 


dy 
=“ == 
dx 
=o? = = 


(-1)z7? 
1 
z2 
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Activity 11 Using the sum rule 


Differentiate the following functions. 

(a) f(w) =60?-2e+1 — (b) f(z) = 3a +20? +2- } 
(c) fe)=5e+1 Wasan (©) fle) = (1+. 22)(1 +32) 
(O Ru) = (u+3)?—(@) fle) = 30a a) 
wio- Oa 


t 
O= W y= 4 e +52) 


In this section you’ve seen how to differentiate any power function, and 
you’ve met the constant multiple rule and the sum rule. So, in particular, 
you can now differentiate any polynomial function, that is, any function of 
the form 


f(x) =a sum of terms, each of the form az”, 
where a is a number and n is a non-negative integer. 


Of course, you can also differentiate many other functions too, such as 
some functions involving negative or fractional powers of x. 


You’ve seen that every function of the form f(x) = x”, where n is a 
positive integer, is differentiable at every value of x, and that so is every 
constant function. These facts, together with the constant multiple rule 
and the sum rule, give the following useful fact. 


2 Finding derivatives of simple functions 


Every polynomial function (with domain R) is differentiable at every 
value of x. 


To finish this subsection, here’s a proof of the sum rule, using 
differentiation from first principles. It uses the Lagrange notation form of 
the sum rule, which is repeated below. 


Sum rule (Lagrange notation) 
If k(x) = f(x) + g(x), where f and g are functions, then 
k'(x) = f'(x) + g'(x), 


for all values of x at which both f and g are differentiable. 


A proof of the sum rule 
Suppose that f and g are functions, and that the function k is given by 
k(x) = f(x) + g(x). Let x denote any value at which both f and g are 
differentiable. To find k’(x), you have to consider what happens to the 
difference quotient for k at x, which is 

k(x +h) — k(x) 

h 

(where h can be either positive or negative, but not zero), as h gets closer 
and closer to zero. Since k(x) = f(x) + g(x), this expression is equal to 

(f(@ +h) +g +h)) — (F(z) + 9(2)) 

h bd 

that is, 

f(2 +h) + g(a +h) - f(z) - g(z) 

h 7 
which is equal to 
f(z +h) - f(z) g(x + h) — g(x) 
h h 

The expression in the first pair of large brackets is the difference quotient 
for f at x, and the expression in the second pair of large brackets is the 
difference quotient for g at x. So as h gets closer and closer to zero, the 
values of these two expressions get closer and closer to f'(x) and g'(x), 
respectively. Hence the whole expression gets closer and closer to 
f(z) + g'(x). So 

ki(a) = f'(æ) + g'(x), 


which is the sum rule. 
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Figure 29 The lines 
(a) y= 2x — 2 
(b) y = -3z + 1 


Figure 30 The tangent to 
the graph of y = x? at the 
point with x-coordinate 1 
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3 Rates of change 


You saw in Unit 2 that the gradient of a straight-line graph is the rate of 
change of the variable on the vertical axis with respect to the variable on 
the horizontal axis. For example, if the relationship between the variables 
x and y is represented by a straight line with gradient 2, then y is 
increasing at the rate of 2 units for every unit that x increases, as 
illustrated in Figure 29(a). Similarly, if the relationship between x and y is 
represented by a straight line with gradient —3, then y is decreasing at the 
rate of 3 units for every unit that x increases, as illustrated in Figure 29(b). 


As mentioned in Subsection 1.1, the idea of a gradient as a rate of change 
also applies to curved graphs. For example, you’ve seen that the graph of 
the equation y = x? has gradient 2 at the point with x-coordinate 1. This 
means that, when x = 1, the variable y is increasing at the rate of 2 units 
for every unit that x increases, as illustrated in Figure 30. 


Of course, unlike in Figure 29(a), in Figure 30 the variable y doesn’t 
actually increase by two units for every unit that x increases. This rate of 
change is an ‘instantaneous’ value, valid only for the xz-value 1. For other 
values of x, the rate of change of y with respect to x (the gradient of the 
graph) is different. 


Since the gradient of any graph of the variable y against the variable x is 
given by the derivative dy/dz, another way to think about the derivative 
dy/dz is that it is the rate of change of y with respect to x. 


As you’ve seen, it’s particularly helpful to think of a gradient as a rate of 

change when you’re working with a graph that models a real-life situation. 
In this section you'll look at two types of real-life situation involving rates 
of change. 


3.1 Displacement and velocity 


Remember from Subsection 2.4 of Unit 2 that when you model the motion 
of an object along a straight line, you choose some point on the line to be 
the reference point, and one of the two directions along the line to be the 
positive direction. Then the object’s displacement at any particular time 
is its distance from the reference point, with a plus or minus sign to 
indicate its direction from this point. For example, in Figure 31 an object 
at position A has a displacement of 12m, and an object at position B has 
a displacement of —7 m. 


reference positive 
B point A direction 
© e e > 
7m 12m 


Figure 31 Positions along a straight line 


You saw that if the displacement of an object along a straight line is 
plotted against time, then the gradient of the resulting graph is the rate of 
change of the object’s displacement with respect to time, which is called its 
velocity. (Remember that, in the calculus units in this module, 
displacement and velocity are always one-dimensional, and hence are 
represented by scalars, with direction indicated by the signs of the scalars.) 


For example, Figure 32 shows the two displacement—time graphs for a man 
walking along a straight path that you saw in Subsection 1.1. The 
reference point on the straight path has been chosen to be the point where 
the man begins his walk, and the positive direction has been chosen to be 
the direction in which he walks. Time is measured from the moment when 
the man begins his walk. 


Aa Aa 
= = 
wn A 
es 03 as 104 
z z 
o o 
E S| 
S S 
E 5 8 54 
A an 
a ae 
5 ae) 
OES), © val Sox A se Oe Se! eae a ae 
time, t (hours) time, t (hours) 


(a) (b) 


Figure 32 Two displacement—time graphs for a man walking along a 
straight path 


In each of the two graphs in Figure 32, the quantity on the horizontal axis 
(time) is measured in hours, and the quantity on the vertical axis 
(distance) is measured in kilometres, so the gradient is measured in 
kilometres per hour. The graph in Figure 32(a) is a straight line with 
gradient 6kmh~!, which means that the man is walking at a constant 
velocity of 6kmh7!. In the graph in Figure 32(b), the man seems to start 
off walking at a velocity of about 6kmh7!, but his velocity continuously 
decreases as he walks. 


In general, since the gradient of the displacement—time graph of any object 
that moves along a straight line is the velocity of the object, we have the 
following important fact. 


3 Rates of change 
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Relationship between displacement and velocity 


Suppose that an object is moving along a straight line. If t is the time 
that has elapsed since some chosen point in time, s is the 
displacement of the object from some chosen reference point, and v is 
the velocity of the object, then 


_ ds 
v= ap 
(Time, displacement and velocity can be measured in any suitable 
units, as long as they’re consistent.) 


An example of a set of consistent units is seconds for time, metres for 
displacement, and metres per second for velocity. These units are 
consistent because the units for time and displacement are the same as the 
units for time and displacement within the derived units for velocity. 


Notice that displacement is denoted by the letter s in the box above and in 
Figure 32. This is the usual choice of letter for this quantity. The more 
natural choice of d is usually avoided, because it might cause confusion 
with the d used in Leibniz notation. Velocity is usually denoted by the 
letter v, and time by the letter t, as you’d expect. 


One explanation of the fact that displacement (or distance) is usually 
denoted by the letter s is that it is the initial letter of the Latin word 
for distance, which is ‘spatium’. 


The fact in the box headed ‘Relationship between displacement and 
velocity’ above tells you that if you have a formula for the displacement of 
an object moving along a straight line, in terms of time, then you can use 
differentiation to obtain a formula for the object’s velocity in terms of time. 


This fact is used in the next example. Here the equation of the 
displacement—time graph in Figure 32(b), which is 


= 5 42 
s=6t- 3t, 


is used to obtain an equation for the man’s velocity in terms of time. 


Example 5 Using differentiation to find a velocity 


Suppose that a man walks along a straight path, and his displacement 
s (in kilometres) at time t (in hours) after he begins his walk is given 
by 


s = 6t— 3. 
Let the man’s velocity at time t be v (in kilometres per hour). 
(a) Find an equation expressing v in terms of t. 
(b) Hence find the man’s velocity one hour into his walk. 
Solution 


(a) The man’s displacement s at time t is given by 


s=6t— $t’. 
Hence his velocity v at time t is given by 
ds 
"=H 
=0= - x 2i 
=6-— Št. 
That is, the required equation is v = 6 — Št. 
(b) When t = 1, 


v=6-— Ť x 1 = 4.75. 


So the man’s velocity after one hour is 4.75 km h™!. 


Figure 33 shows the tangent to the graph in Figure 32(b) at the point one 
hour after the man begins his walk. You can see that the gradient of the 
tangent — that is, the man’s velocity — does indeed seem to be roughly 
4.75 kmh7!, as calculated in the example above. 


Notice that in Example 5 we started off knowing how the man’s 
displacement changed with time — the relationship between these two 
quantities is given by the equation s = 6t — 21. We used this information 
to find how the man’s velocity (his rate of change of displacement) changed 
with time — we found that this relationship is given by the equation 
v=6- 5t, This is the sort of calculation that differential calculus allows 
you to carry out: if you know the values taken by a changing quantity 
throughout a period of change, then you can use differentiation to find the 
values taken by the rate of change of the quantity throughout the same 
period. 


3 Rates of change 


S 
2x 


displacement, s (km) 


051152" 
time, t (hours) 


Figure 33 The tangent to 
the graph of s = 6t — 5g? at 
the point with t-coordinate 1 
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A falling ball, photographed 
at equal time intervals 
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You might find it enlightening to think about what the process of 
differentiation from first principles means when it is applied to a 
displacement—time graph. In this case the difference quotient at a 
particular time value represents the average velocity over a time interval 
starting or ending at that time value. As the time interval gets smaller and 
smaller, the average velocity gets closer and closer to the instantaneous 
velocity at that time value. 


The next activity is about a particular type of motion in a straight line: 
that of an object falling vertically from rest, near the surface of the Earth. 
The phrase ‘from rest’ here means that the object has no initial velocity 
when it begins falling. If the effect of air resistance is negligible (which it is 
if the object is fairly compact and the fall is fairly short), then the distance 
fallen by the object isn’t affected by how heavy it is, for example, but 
depends only on the time that it has been falling. In fact, the total 
distance fallen is proportional to the square of the time that it has been 
falling. If time is measured in seconds and distance is measured in metres, 
then the constant of proportionality is about 4.9, so at time ¢ (in seconds) 
after the object began falling it will have fallen a total distance of about 
4.9¢? (in metres). 


When we model the motion of a falling object using displacement rather 
than distance, we often take the positive direction along the line of motion 
to be upwards. So, if we also take the reference point to be the point from 
which the object starts to fall, then the displacement s (in metres) of the 
object at time Ł (in seconds) after it starts to fall is given by the equation 


s = —4,9t?. 


The graph of this equation is shown in Figure 34. 


time, t (seconds) 


T T T >) 


2 3 4 


displacement, s (m) 
| 
w 
= 


s = —4.9t? 


Figure 34 The displacement-—time graph for an object falling from rest 


In the next activity, remember that, just as distance is the magnitude of 


displacement, so speed is the magnitude of velocity. So, for example, an 


object with a velocity of —3m s7} has a speed of 3 m s7}. 
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Activity 12 Finding the velocity of a falling object 


(a) You saw above that the displacement s (in metres) of an object falling 
from rest, at time t (in seconds) since it began falling, is given by the 
equation s = —4.9t?. Use this equation to find an equation for the 
velocity v (in ms~') of the object at time t. 


(b) Hence find the velocity of such an object three seconds into its fall. 
Give your answer in metres per second, to one decimal place. 


(c) If an object is dropped from the top of a tower, how long does it take 
to reach a speed of 15ms~!? Give your answer in seconds, to one 
decimal place. 


The fact that the distance travelled by a falling object is proportional 
to the square of the time that it has been falling was determined by 
the Italian physicist Galileo Galilei in the sixteenth century, using 
experiments that he carried out in his workshop. At the time, there 
were no clocks that could measure intervals of time short enough to 
allow him to determine the motion of a falling object, but instead he 
rolled balls down a sloping groove, and reasoned that their motion 
would be similar to that of a falling object, but slower. 


3.2 Total cost and marginal cost 


Here’s another example of a situation where it’s useful to think of the Galileo Galilei (1564-1642) 
gradient of a graph (and hence the derivative of a function) as a rate of 

change. This example shows you one way in which the idea of 

differentiation can be used in economics. 


Suppose that a small confectionery company has found that the weekly 
cost of making its milk chocolate consists of a fixed cost of £3000 (to pay 
for the rent and maintenance of its premises and equipment, for example), 
plus £8 per kilogram of chocolate made (to pay for the ingredients and 
staff time, for example). In other words, the weekly cost, c (in £), is 
modelled by the equation 


c = 3000 + 84, (3) 


where q is the amount of chocolate made in the week, in kilograms. The 
graph of this equation is shown in Figure 35. 
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On average, each person in 
the UK consumes about 10 kg 
of chocolate per year 
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total cost, c (£) 


quantity, q (kg) 


Figure 35 The graph of the equation c = 3000 + 8q, a model for the total 
weekly cost c (in £) of making milk chocolate in terms of the quantity q 
(in kg) of chocolate made 


Suppose that the company is currently making a particular quantity of 
milk chocolate per week, but is thinking of increasing its production. Then 
the additional weekly cost of making the extra chocolate will, of course, be 
£8 per extra kilogram made. This is known as the marginal cost per 
kilogram of making the extra chocolate. It’s different from the real cost 
per kilogram of making the chocolate (usually called the unit cost or 
average cost), which is equal to the total cost, 3000 + 8q (in £), divided 
by the quantity in kilograms of chocolate made, q. 


In fact, the marginal cost of making the extra chocolate is the gradient of 
the graph in Figure 35. This is because the gradient of the graph is the 
rate at which the quantity on the vertical axis (the total cost, in £) 
increases as the quantity on the horizontal axis (the quantity of chocolate 
made, in kg) increases. The fact that the gradient is 8 pounds per 
kilogram, which is normally written as £8 per kilogram, tells you that the 
total cost is increasing at a rate of £8 for every kilogram by which the 
quantity of chocolate increases. 


Often an equation that models the total cost of making a quantity of a 
product doesn’t have a straight-line graph. For example, suppose that the 
confectionery company has found that a better model for the total weekly 
cost c (in £) of making a quantity q (in kg) of milk chocolate is 

c = 3000 + 4q + iE: 


The graph of this equation is shown in Figure 36. 


total cost, c (£) 


> 


T 
200 400 600 800 
quantity, q (kg) 


Figure 36 The graph of the equation c = 3000 + 4q + TE: an 
alternative model for the total weekly cost c (in £) of making milk 
chocolate in terms of the quantity q (in kg) of chocolate made 


As before, the gradient of the graph tells you the rate at which the total 
weekly cost increases as the amount of chocolate made increases. In other 
words, the gradient of the graph is the marginal cost per kilogram of 
making extra chocolate. You can see that the gradient of the graph — the 
marginal cost per kilogram — increases as the quantity of chocolate made 
increases. This tells you that the more chocolate the company makes, the 
larger is the cost of making an extra kilogram of chocolate. 


There are various reasons why this might be the case. For example, to 
increase its production of chocolate the company might have to pay its 
staff at higher rates, for overtime or non-standard hours. 


For any particular quantity q (in kg) of chocolate being made, the 
marginal cost per kilogram of making any extra chocolate is the gradient 
of the graph at that value of q, which as you know is given by dc/dg. Of 
course, this is an ‘instantaneous’ value, valid only for that value of q. 


3 Rates of change 
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Activity 13 Working with marginal cost 


Suppose that the confectionery company discussed above has decided that 
the second equation above, namely 


c = 3000 + 44 + aga’, 


is an appropriate model for the total weekly cost of making its milk 
chocolate. Here q is the quantity of chocolate made (in kg), and c is the 
total cost (in £). Let the marginal cost per kilogram of making any extra 
chocolate be m (in £). 


(a) Use differentiation to find an equation for the marginal cost m in 
terms of the quantity q. 


(b) What is the marginal cost per kilogram of making extra chocolate 
when the amount of chocolate already being made is 300 kilograms? 


(c) The company sells all the chocolate that it makes at a price of £16 per 
kilogram. It decides to keep increasing its weekly production of 
chocolate until the marginal cost is equal to the price at which it sells 
the chocolate. (This is because if it increases its production any 
further, then it will cost more money to make the extra chocolate than 
the company will obtain by selling it.) By writing down and solving a 
suitable equation, find the weekly quantity of chocolate that the 
company should make. 


The fact that marginal cost is the derivative of total cost is used in many 
economic models. 


4 Finding where functions are 
increasing, decreasing or stationary 


Because the derivative of a function tells you the gradient at each point on 
the graph of the function, it gives you information about the shape of the 
graph. In this section, you’ll see how you can use the derivative of a 
function to deduce useful facts about its graph. 


4 Finding where functions are increasing, decreasing or stationary 


4.1 Increasing/decreasing criterion 


You saw the following definitions in Unit 3. 


Functions increasing or decreasing on an interval 


A function f is increasing on the interval / if for all values zı and 
x2 in I such that x1 < x2, 


F (x1) < F(a). 
A function f is decreasing on the interval / if for all values zı and 
x2 in I such that x1 < xo, 


f (a1) > f(x2). 
(The interval J must be part of the domain of f.) 


Informally, a function is increasing on an interval if its graph slopes up on 
that interval, and is decreasing on an interval if its graph slopes down on 
the interval. For example, the function f(a) = 2z? — 3x? — 36z is 
increasing on the interval (—co, —2), decreasing on the interval (—2, 3), 
and increasing on the interval (3, 00), as illustrated in Figure 37. 


Figure 37 The graph of the function f(x) = 2x? — 3x? — 36x 


Of course, you can’t tell from the graph alone that this function stops 
increasing and starts decreasing when zx is exactly —2, or that it stops 
decreasing and starts increasing when x is exactly 3. You'll see shortly how 
to confirm that the function is increasing and decreasing on the intervals 
mentioned above, exactly. 


You could also say that the function in Figure 37 is increasing on the 
interval (—oo, —2], decreasing on the interval [—2,3], and increasing on the 
interval [3, 00). However, when we discuss intervals on which a function is 
increasing or decreasing, it’s often helpful to consider intervals that don’t 
overlap, so we usually consider open intervals. 
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Figure 38 Some tangents to 
the graph of 
f(x) = 2x3 — 3x? — 36x 
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Since positive gradients correspond to a graph sloping up, while negative 
gradients correspond to a graph sloping down, the derivative of a function 
tells you the intervals on which the function is increasing or decreasing, as 
set out below. 


Increasing/decreasing criterion 
If f'(x) is positive for all x in an interval J, then f is increasing on T. 


If f'(x) is negative for all x in an interval J, then f is decreasing on T. 


For example, Figure 38 shows parts of the tangents to some points on the 
graph of the function f(x) = 2x? — 3x? — 36x. It illustrates that positive 
gradients correspond to intervals on which the graph is increasing, and 
negative gradients correspond to intervals on which the graph is decreasing. 


It’s sometimes useful to determine whether a particular function that 
you’re working with is increasing on a particular interval, or decreasing on 
a particular interval. (For example, this can help you sketch its graph.) 
You can often use the increasing/decreasing criterion to do this, as 
illustrated in the next example. This example confirms that the function in 
Figure 37 is increasing and decreasing on the intervals mentioned earlier. 


Example 6 Using the increasing/decreasing criterion 


Show that the function f(x) = 22° — 3x? — 36x is increasing on each 
of the intervals (—oo, —2) and (3,00), and decreasing on the interval 
(—2, 3). 
Solution 
@. Find the derivative. @ 
The derivative is 

f(a) = 62? or 36. 


@. Show that the derivative is always positive when z is in the 
interval (—oo, —2) or in the interval (3, 00), and always negative when 
x is in the interval (—2,3). Try factorising to help you do this. & 


Factorising gives 

f@) =6@? —2—6) = Oe + 2a = 3). 
When z is less than —2, the values of x + 2 and x — 3 are both 
negative, and hence the value of f'(x) = 6(x + 2)(x — 3) is positive. 
Similarly, when x is greater than 3, the values of x + 2 and x — 3 are 


both positive, and hence the value of f'(x) = 6(x + 2)(x — 3) is also 
positive. 


4 Finding where functions are increasing, decreasing or stationary 


Activity 14 Using the increasing/decreasing criterion 


Consider the function f(x) = 32? — 8x? + 30x — 36. 

(a) Find the derivative f'(x), and factorise it. 

(b) Show that f is increasing on each of the intervals (—o0, 3) and (5,00). 
(c) Show that f is decreasing on the interval (3,5). 


An alternative way to set out working of the type in Example 6 and in the 
solution to Activity 14 is to use a table of signs, in the way that you saw in 
Section 5.4 of Unit 3. You’ll see examples of this in the next subsection. 


Activity 15 Using the increasing/decreasing criterion again 


Consider the function f(x) = x? — 3x? + 4x + 3. 


(a) Find the derivative f'(x), and complete the square on this expression. 
(You saw how to complete the square in Unit 2.) 


(b) Hence show that f is increasing on the interval (—0o0, 00) (that is, on 
the whole of its domain). 


If you look back to the statement of the increasing/decreasing criterion on 
page 260, you'll see that the first part begins 


If f'(x) is positive for all x in an interval J .... 
A slightly more concise way to express the same thing is to say 
If f’ is positive on an interval J .... 


Similarly, the beginning of the second part of the increasing/decreasing 
criterion, 


If f'(x) is negative for all z in an interval J ..., 
can be expressed as 
If f’ is negative on an interval T .... 


These more concise forms are used in the next subsection. 
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4.2 Stationary points 
Consider again the function 
f(x) = 22° — 327 — 36x, 


which was discussed in the previous subsection. Its graph is shown again in 
Figure 39. As you saw earlier, the gradient of the graph is positive on the 
interval (—oo, —2), negative on the interval (—2,3) and positive again on 
the interval (3, co). 


Between these open intervals, there are points at which the gradient of the 
graph is zero, as shown in Figure 39. A point at which the gradient of a 
graph is zero is called a stationary point. So the graph in Figure 39 has 
stationary points at the points with z-coordinates —2 and 3. If you work 
out the corresponding y-coordinates (by substituting the x-coordinates 
into the rule of f in the usual way), then you find that the stationary 
points are (—2, 44) and (3, —81). 


Figure 39 The stationary points on the graph of f(x) = 2”? — 3x? — 36x 


The term ‘stationary point’ is used to refer to the x-coordinate of a 
stationary point, as well as to the point itself. This is because real 
numbers are sometimes called points — you can think of them as points on 
the number line. (You’ve already seen that a real number at one end of an 
interval is known as an endpoint.) So, for example, we can say that the 
stationary points of the function f(x) = 2x? — 3x? — 36x are —2 and 3. 


At the left-hand stationary point in Figure 39, the value taken by the 
function is larger than at any other point nearby, so we say that the 
function has a local maximum at this point. Similarly, at the right-hand 
stationary point, the value taken by the function is smaller than at any 
other point nearby, so we say that the function has a local minimum at 
this point. 


Notice that the value taken by a function at a local maximum or local 
minimum isn’t necessarily the greatest or least value that the function 
takes overall. For example, you can see from Figure 39 that the function 
f(z) = 2x? — 3x? — 36x takes a larger value when x = 6, say, than it does 
at its local maximum, at x = —2. This is why the word ‘local’ is used. 


4 Finding where functions are increasing, decreasing or stationary 


A point where a function has a local maximum or local minimum is called 
a turning point, because if you imagine tracing your pen tip along the 
graph of the function from left to right, then at a local maximum or 
minimum it stops going up or down, and turns to go the other way. 


Notice that at a local maximum the graph of a function is increasing on 
the left and decreasing on the right, and at a local minimum it is 
decreasing on the left and increasing on the right. 


There are other types of stationary points, apart from turning points. For 
example, Figure 40(a) shows the graph of the function f(x) = z’. The 
derivative of this function is f'(x) = 3x, so the gradient of the graph when 
x=Ois 

f0) =3x0 =6. 
So the function f(x) = x? has a stationary point at x = 0. However, this 
stationary point isn’t a turning point. The graph of the function f(x) = x3 


is increasing on both sides of the stationary point, and just levels off 
momentarily at the stationary point itself. 


Similarly, the graph of the function g(x) = —x° is decreasing on both sides 
of its stationary point, as illustrated in Figure 40(b). 


Figure 40 The graphs of (a) f(x) = x? (b) g(x) = —x3 YA 

A stationary point of either of the types shown in Figure 40 is called a 24 

horizontal point of inflection. You'll see the reason for this term later 

in the unit. Notice that the tangent to a curve at a horizontal point of 

inflection crosses the curve at that point. ; ; ; 2; 
. . . . . . . —2 ll 1 2 ig 

Some stationary points are neither turning points nor horizontal points of S 

inflection. For example, every point on the graph of the equation y = 1 7 


(see Figure 41), or on any horizontal line, is a stationary point that is 
neither a turning point nor a horizontal point of inflection. 


When you’re working with a function, it’s sometimes useful to find its 

stationary points, and, if possible, determine the nature of each stationary Figure 41 The graph of 
point — that is, determine whether it is a local maximum, a local minimum Y= 1 

or a horizontal point of inflection. In the rest of this subsection yov’ll learn 
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how to do this, and in the next subsection you'll see some examples of why 
it can be a useful thing to do. 


You’ve seen that the stationary points of a function are the values of x at 
which the gradient of its graph is zero. So the method for finding the 
stationary points of a function is as follows. 


Strategy: 
To find the stationary points of a function f 


Solve the equation f'(x) =0. 


In the next example, this strategy is used to confirm that the function 
f(x) = 22° — 3x? — 362, 


which we considered at the beginning of this subsection and in the last 
subsection, has stationary points at x = —2 and x = 3. 


| A Example 7 Finding the stationary points of a function 
== Find the stationary points of the function 
PE) = 22 = 37 = ser, 
Solution 
®. Find the derivative f'(x). Factorise it if possible. & 
The derivative is 
f'(z) = 6x” — 6x — 36 
=6@ =7=—6) 
= 6(a + 2)(x — 3). 
®. Solve the equation f'(x) =0. & 
Solving the equation f'(x) = 0 gives 
6(a + 2)(x — 3) = 0; 
that is, 
w=} OF P= 


Hence the stationary points are —2 and 3. 
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Activity 16 Finding the stationary points of a function 


Find the approximate values, to two decimal places, of the stationary 
points of the function 


f(£) = z? — x? — 2z. 


To determine whether a stationary point of a function f is a local 
maximum, local minimum or horizontal point of inflection, you can use the 
following facts. If there’s an open interval immediately to the left of the 
stationary point, and an open interval immediately to the right of the 
stationary point, such that 


e f is increasing on the left interval and decreasing on the right interval, 
then the stationary point is a local maximum 


e f is decreasing on the left interval and increasing on the right interval, 
then the stationary point is a local minimum 


e f is increasing on both intervals or decreasing on both intervals, then 
the stationary point is a horizontal point of inflection. 


For example, Figure 42 shows the graph of a function with a local 
maximum, and open intervals immediately to the left and right of the local 
maximum. The function is increasing on the left interval, and decreasing 
on the right interval. 


Combining the facts above with the increasing/decreasing criterion (stated 
on page 260) gives the following useful test for determining the nature of a 
stationary point. It’s known as the first derivative test — the reason for the 
word ‘first’ is explained in Section 5. 


First derivative test (for determining the nature of a 
stationary point of a function f) 


If there are open intervals immediately to the left and right of a 
stationary point such that 


e f'(x) is positive on the left interval and negative on the right 
interval, then the stationary point is a local maximum 

e f'(x)is negative on the left interval and positive on the right 
interval, then the stationary point is a local minimum 


e f'(x)is positive on both intervals or negative on both intervals, 
then the stationary point is a horizontal point of inflection. 


This test is illustrated in Figure 43. The signs show whether the derivative 


is positive or negative on intervals immediately to the left and right of the 
stationary point. 


Figure 42 Open intervals 
immediately to the left and 
right of a local maximum 


+ = 
= + 
(a) (b) 
+ = 
w a S 
(c) (d) 
Figure 43 The signs of the 
derivative to the left and 


right of different types of 
stationary point 
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Here’s an example showing how you can use the first derivative test to 
determine the nature of the stationary points of a function. The example 
uses the technique of drawing up a table of signs, which you met in Unit 3. 


Example 8 Determining the nature of stationary points using the 
first derivative test 
Consider the function 


oe — 3r? 5a — 62 — 2. 
(a) Find the stationary points of f. 


(b) Determine whether each stationary point of f is a local 
maximum, local minimum or horizontal point of inflection. 


Solution 
(a) Œ. Find f'(x), and factorise it, if possible. @ 
The derivative is 
f'(v) = 4a” + 10x — 6 
= 2(22? + bz — 3) 
= 2(2x — 1)(x + 3). 
@®. Solve the equation f'(x) = 0. ® 
Solving the equation f'(x) = 0 gives 
2(2x — 1)(x + 3) = 0; 
that is, 
T= 5 Gir 4 = =3), 
Hence the stationary points of f are —3 and 5. 


(b) &. Construct a table of signs to help you find intervals on which 
f'(x) is negative or positive. In the column headings, write the 
values where f'(x) = 0 (that is, the stationary points of f), in 
increasing order, and also the intervals to the left and right of, 
and between, these values. In the row headings, write the factors 
of f'(x), and then f'(x) itself. Find the signs in the way that you 
learned in Subsection 5.4 of Unit 3. ® 


4 Finding where functions are increasing, decreasing or stationary 


Activity 17 Determining the nature of stationary points using the 
first derivative test 


Consider the function 
f(x) = 304 — 223 — 9x? + 7. 
(a) Find the stationary points of f. 
Hint: to factorise f'(x), first notice that it has z as a factor. 


(b) Determine whether each stationary point of f is a local maximum, 
local minimum or horizontal point of inflection. 


There’s an alternative way to apply the first derivative test, which you can 
use instead of constructing a table of signs, if you prefer. It’s described in 
the following box. The different possibilities in step 3 are illustrated in 
Figure 44. 
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Strategy: 
To apply the first derivative test by choosing sample points 


1. Choose two points (that is, two x-values) fairly close to the 
stationary point, one on each side. 


2. Check that the function is differentiable at all points between the 
chosen points and the stationary point, and that there are no 
other stationary points between the chosen points and the 
stationary point. 


3. Find the value of the derivative of the function at the two chosen 
points. 


e If the derivative is positive at the left chosen point and 
negative at the right chosen point, then the stationary point 
is a local maximum. 


e If the derivative is negative at the left chosen point and 
positive at the right chosen point, then the stationary point 
is a local minimum. 


e Ifthe derivative is positive at both chosen points or negative 
at both chosen points, then the stationary point is a 
horizontal point of inflection. 


f’ positive f’ negative 
f’ negative f’ positive 


(a) (b) 


, positive f’ negative 


f’ positive f’ negative 


(c) (a) 


Figure 44 How to recognise, by considering points on each side of a 
stationary point, whether the stationary point is (a) a local maximum 
(b) a local minimum or (c), (d) a horizontal point of inflection 


When you use this alternative way of applying the first derivative test, it 
doesn’t matter how far the two points that you choose are from the 
stationary point, as long as the conditions in step 2 are satisfied. The 
reason why these conditions are important is explained after the next 
example. This example repeats Example 8, but uses the alternative way of 
applying the first derivative test. 
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Example 9 Determining the nature of stationary points using the | A | 
alternative way of applying the first derivative test mm 
Consider the function 

TaS ir? Lo b= 
(a) Find the stationary points of f. 


(b) Determine whether each stationary point of f is a local 
maximum, local minimum or horizontal point of inflection. 


Solution 
(a) ®. This part is the same as in Example 8. @ 
The derivative is 
f(a) = 4 * 107 —6 
= 2(2x7 + 5a — 3) 
= 2(2x — 1)(x + 3). 
Solving the equation f'(x) = 0 gives 
2(2x — 1)(x + 3) = 0; 
that is, 
w= 3 Cr g = =o 
Hence the stationary points of f are —3 and 2. 


(b) &. Apply the method in the box above. There are many possible 
choices of points in step 1; choose them to be simple as 
possible. ® 


Consider the values —4, 0 and 1. The values —4 and 0 lie on each 
side of the stationary point —3, and the values 0 and 1 lie on each 
side of the stationary point 5. 


The function f is differentiable at all values of x (as is every 
polynomial function). Also, there are no stationary points 
between —4 and —3 or between p and 0. aye there are no 
stationary points between 0 and 4 or between 4 z aml i, 


Since f’(x) = 4x” + 10x — 6, we oe 
Poa) = de4)? 104) 6 162 = 40 — 6 18. 
FO = =e, 


fA)=4+10-6=8. 


Since f’ is positive at —4 and negative at 0, the stationary point 
—3 is a local maximum. 


Since f’ is negative at 0 and positive at 1, the stationary point 5 
is a local minimum. 
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Figure 45 illustrates the reasons for the restrictions mentioned in the 
strategy box above. Each of the three graphs shows a situation where you 
might be misled into thinking that a stationary point is a local minimum, 
whereas in fact it is a local maximum. 


In Figure 45(a), the function isn’t differentiable at all points between the 
chosen points and the stationary point, because it isn’t even defined at all 
such points. It is undefined (and has vertical asymptotes) at x = 4 and 
C=. 


In Figure 45(b), the function isn’t differentiable at all points between the 
chosen points and the stationary point. It isn’t differentiable at x = 4 and 
r=8. 


Finally, in Figure 45(c), there are further stationary points between the 
chosen points and the stationary point of interest. There are stationary 
points at x = 4 and z = 8. 


Figure 45 The graphs of three functions 


You can try using the alternative way of applying the first derivative test 
in the next activity. 


Activity 18 Determining the nature of stationary points using the 
first derivative test, again 


Consider the function 
f(z) = a4 — 22°. 
(a) Find the stationary points of f. 
Hint: to factorise f'(x), first notice that it has a? as a factor. 


(b) By finding the value of f’ at appropriate points, determine whether 
each stationary point of f is a local maximum, local minimum or 
horizontal point of inflection. 


4 Finding where functions are increasing, decreasing or stationary 


4.3 Uses of finding stationary points 


In this subsection you’ll see some situations where it can be useful to find, 
and sometimes determine the nature of, the stationary points of a function. 


Finding the vertex of a parabola 


In Unit 2 you saw some methods for finding the vertex of a u-shaped or 
n-shaped parabola. An alternative method is to use the fact that the 
vertex of a parabola is a stationary point. You’re asked to do this in the 
next activity, for a parabola that models a particular economic situation. 


The type of economic graph in this activity is different from the type that 
you considered in the ‘chocolate manufacturer’ example in Subsection 3.2. 
There you considered graphs of total cost against quantity of product 
made, whereas here you'll consider a graph of total profit against quantity 
of product made. 


Activity 19 Using differentiation to find the vertex of a parabola 


A bakery has found that its daily profit, p (in £), from making and selling 
cakes of a particular type can be modelled by the equation 


p = —0.006q? + 3.39, 
where q is the quantity of cakes sold. 


(Since the coefficient of q? is negative, the graph of p against q is an 
n-shaped parabola, so, as the quantity of cakes sold increases, the profit 
first increases and then decreases. The decrease might be caused by the 
fact that, when the quantity of cakes increases beyond a certain number, 
the limits of the bakery’s facilities means that it becomes more expensive Cakes 
to make them — for example, the bakery may have to pay its staff at higher 

rates, for overtime or non-standard hours.) 


(a) Find the formula for E 
q 


(b) Use your answer to part (a) to find the stationary point of the 
parabola that is the graph of p against q. Hence write down the 
number of cakes that the bakery should make and sell per day if it 
wants to earn the maximum profit. 


(c) Find the daily profit earned by the bakery from these cakes if it sells 
the number of them found in part (b). 
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Sketching the graph of a function 


Another situation where it can be useful to find, and in this case determine 
the nature of, the stationary points of a function is when you want to 
sketch its graph. If you plot the stationary points, and indicate their 
natures, then you can often draw a reasonable sketch of the graph. 


It’s a good idea to also include on your sketch any z- or y-intercepts that 
you can find and plot easily. It’s usually straightforward to find the 
y-intercept, if there is one, but often not so easy to find any x-intercepts. 


As an illustration, we’ll consider cubic functions, that is, functions of the 
form 


f(z) = az? + ba? + ex +d, 


where a, b, c and d are constants, with a 4 0. The graph of every cubic 
function, in common with the graph of every polynomial function that 
isn’t a constant function, tends to plus or minus infinity at the left and 
right. You can find its shape between these extremes by finding its 
stationary points, if it has any. 


The derivative of every cubic function is a quadratic function, because 
when you differentiate the formula for a cubic function you reduce the 
power of x in each term by 1. So to find the stationary points of a cubic 
function you have to solve a quadratic equation, and hence there are at 
most two stationary points. 


Example 10 Sketching the graph of a cubic function 
Sketch the graph of the function f(a) = 42° — 2? — 3a + 2. 
Solution 
@. Find the stationary points, including the y-coordinates. © 
The derivative is 

IO =r Sp SS = (r+ r= 9). 
Solving the equation f'(x) = 0 gives 

(x + 1)(x — 3) = 0; 


that is, 
g=—l Of re 
When z = —1, 
y= f=) 
=) = Ele seer. 
—}-14342 
ay 
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When zx = 3, 
y = f(3) 
= 4x 3°-3?-3x3+2 
=9-9-9+2 
= —7. 


So the stationary points are (—1, it) and (3,—7). 


@. Determine the nature of the stationary points. ® 


ctl 
xr—3 


F(x) 


slope of f ra —|N | z 


By the first derivative test, (—1, 4) is a local maximum, and (3, —7) 
is a local minimum. 


@. Find the y-intercept. Don’t try to find the x-intercepts in this 
case, as this isn’t easy, because it involves solving the equation 
ir? — r? — 3r +2 = 0. @ 


When zx = 0, 
Oe 22 


@. Mark the stationary points and the y-intercept on a pair of 
labelled axes, and indicate the nature of the stationary points, as 
shown on the left below. Hence sketch the graph, as shown on the 
right. Label the points that you’ve marked with their coordinates, 
and label the graph with its equation. © 


YA 
p= i = Nae 


RY 
Xy 


(3, =y) 
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You’ve now seen the graphs of several cubic functions in this unit. These 
include the one in the example above, and the graphs of f(x) = x? and 
g(x) = —2x? in Figure 40 on page 263. Before you go on, it’s useful for you 
to get to know the general shapes of the graphs of cubic functions. You 
can do that in the next activity. 


Activity 20 = /nvestigating the graphs of cubic functions 


Open the Investigating cubic functions applet. Experiment with changing 
the values of the constants a, b, c and d in the equation 

y = ax? + bx? + cx + d, and observe the shapes of the resulting graphs. 
Try to answer the following questions. 


(a) How does the sign of a, the coefficient of x, affect the properties of 
the graph? 
(b) How many z-intercepts can a cubic function have? 


Finally, read through the list of properties of graphs of cubic functions 
given below this activity, and check that these properties seem to describe 
the graphs that you’ve seen. 


In Activity 20 you should have seen evidence of the following facts. 


Properties of graphs of cubic functions 
The graph of every cubic function has the following properties. 
e There are two, one or no stationary points. 


e Apart from at any stationary points and in the interval between 
them if there are two, 


— the graph slopes up from left to right if the coefficient of x? 
in the rule of the function is positive 


— the graph slopes down from left to right if the coefficient of 
x? is negative. 


e If there are two stationary points, then one is a local maximum 
and the other is a local minimum, and the graph slopes the other 
way in the interval between them. 


e If there is one stationary point, then it is a horizontal point of 
inflection. 


e There are three, two or one x-intercepts. 
e There is one y-intercept. 


e The graph tends to plus or minus infinity for large positive and 
large negative values of x. 


4 Finding where functions are increasing, decreasing or stationary 


Whether a cubic function has two, one or no stationary points depends on 
whether the quadratic equation that you solve to find the stationary points 
has two, one or no solutions. 


It’s useful to keep the properties in the box above in mind when you’re 
sketching the graph of a cubic function. 


Activity 21 Sketching the graphs of cubic functions 


Sketch the graphs of the following cubic functions. 
(a) fle) = ir’ 4g? — 127g — 4 
(b) f(x) = ir? —27 4+ 32 


Hint for part (a): factorise the derivative f'(x) into a product of three 
factors, one of which is —1. 


Finding the greatest and least values of a function on an 
interval of the form [a,b] 


A third situation where it can be useful to find the stationary points of a 
function is when you want to find the greatest or least value that a 
function takes on a particular interval of the form [a,b], where a and b are 
real numbers. This is sometimes a helpful thing to do when the function 
models a real-life situation. You'll see some examples of this in Unit 7. 


Usually the function that you’re working with is continuous on the 
interval — this means that it has no discontinuities in the interval, or, 
informally, that you can draw its graph over the whole interval without 
taking your pen off the paper. (In particular, for a function to be 
continuous on an interval, it must also be defined for all values in the 
interval.) 
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If the function is also differentiable at all values in the interval except 
possibly the endpoints, then its greatest or least value on the interval 
occurs either at a stationary point in the interval or at an endpoint of the 
interval. For example, in Figure 46 the greatest value of the function on 
the interval shown occurs at the right endpoint of the interval, and the 
least value occurs at the local minimum in the interval. 


y 


y = f(z) 
greatest 
value 


Figure 46 The greatest and least values of a particular function on a 
particular interval 


This fact gives the following strategy, which is demonstrated in 
Example 11. 


Strategy: 
To find the greatest or least value of a function on 
an interval of the form {a, b] 


(This strategy is valid when the function is continuous on the interval, 
and differentiable at all values in the interval except possibly the 
endpoints.) 


1. Find the stationary points of the function. 


2. Find the values of the function at any stationary points inside the 
interval, and at the endpoints of the interval. 


3. Find the greatest or least of the function values found. 
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Solution 
The function is 
fo) = —ta° + Dn” + 5a. 
@. Find its stationary points. @ 
Its derivative is 
Pa) = 47-5 
= (a 4 — 5) 
= —(x—5)(x+ 1). 
So its stationary points are —1 and 5. 


@. Find the values of f at any stationary points inside the interval, 
and at the endpoints of the interval. ® 


Both stationary points are inside the interval, and the values of f at 
these points are 


f(-1) = -4(-1) + 2(-1)? +5(-1) = -$ = -22 
f(5)=—4 x 5? +2 x 5+5 x5 = YO = 334. 
The values of f at the endpoints of the interval are 
E Sea) = 
f()=-ExP+2xP+5x7= B= 182. 


@®. Find the greatest and least of the values of f found. ® 


The greatest value of f on the interval [—7, 7] is 1774, and the least 
value is —22. 


The graph of the function in Example 11 is shown in Figure 47. You can 
see that its greatest and least values on the interval [—7, 7] do seem to be 
the values found in the example, at least roughly. 


When you use the strategy in the box above to find the greatest or least 

value of a function on an interval of the form fa, b], it’s important to make Figure 47 The graph of the 
sure that any stationary points that you consider are inside the interval! function 

For example, if Example 11 had asked for the greatest and least values of f(z) = =o + 2a? + 5x 

the function on the interval [0,7], rather than on [—7, 7], then the solution 

would have considered the values of the function at the stationary point 5 

and at the endpoints of the interval, and would have ignored the stationary 

point —1. 
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Activity 22 Finding the greatest and least values of functions on 
intervals of the form |a, b] 


Find the greatest and least values of each of the following functions on the 
stated interval. Check that your answers seem reasonable by plotting the 
graphs of the functions using the module computer algebra system. 


(a) f(x) = 32° — x? — 8x + 1 on the interval [—3, 2] 
(b) f(x) = 3x? — 22 + 5 on the interval [—1, 4] 


Activity 23 Thinking about the strategy 


Give an example of a function and a closed interval such that the strategy 
in the box on page 276 doesn’t work, because the function isn’t 
differentiable (but is defined) at a value in the interval. Draw a 
sketch-graph to illustrate your answer. 


Hint: some graphs of functions with values at which they aren’t 
differentiable are given in Figure 9 on page 217. 


5 Differentiating twice 


In this section you'll learn what it means to differentiate a function more 
than once, and why this can be a helpful thing to do. 


5.1 Second derivatives 


As you’ve seen, the derivative of a function is itself a function, which is 
sometimes called the derived function. This means that the derivative of a 
function can itself be differentiated. The function that is obtained by 
differentiating a function twice in this way is called the second 
derivative (or second derived function) of the original function. In 
Lagrange notation, the second derivative of a function f is denoted by f” 
(read as ‘f double prime’ or ‘f double dash’ or ‘f double dashed’). 


For example, consider the function f(x) = x°. The derivative of f is 
f' (x) = 32”. 
Differentiating this derivative gives the second derivative of f: 


f(x) = 6x. 


Activity 24 Finding a second derivative 


Find the second derivative of the function f(x) = 2x4 + 3x? + 2. 


In Leibniz notation, the second derivative of y with respect to x is denoted 
by d?y/dax? (read as ‘d two y by d x squared’). For example, if 
y = 4r? + 527, then 

dy 2 dy 

— = 127*4+107, and — = 24r + 10. 

dz , dx? 
To see the thinking behind this notation for a second derivative, notice 
that the second derivative of y with respect to x can be written in Leibniz 
notation as 


afd 
dz de 


Historically, this was, rather loosely, abbreviated to 


2 


a d2y 
(=) y, and then to T 


The domain of the second derivative of a function consists of all the values 
of x at which its first derivative is differentiable. We say that the original 
function is twice-differentiable at such values of z. 


Once you’ve differentiated a function twice, you can go on to differentiate 
it three times, and then four times, and so on. The derivatives obtained in 
this way are called the third derivative, the fourth derivative, and so 
on. In Lagrange notation, the third derivative is denoted by f” or f), the 
fourth derivative is denoted by f(®, and the higher derivatives are denoted 
in the same way. In Leibniz notation, the third derivative is denoted by 
d°y/dx, the fourth derivative is denoted by d*y/dz*, and so on. 


The domain of the third derivative of a function consists of all the values 
of x at which the second derivative of the function is differentiable, that is, 
all the values of x at which the original function is three times 
differentiable. Similarly a function can be four times differentiable at 
a value of x, and so on. It can also be differentiable infinitely many 
times at a value of x. For example, every polynomial function is 
differentiable infinitely many times at every value of x. 


In Unit 11 you'll see one reason why the third and higher derivatives of 
functions are useful, but for now you won’t need to work with derivatives 
beyond second derivatives. 
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Activity 25 Thinking about derivatives and second derivatives 


Try to work out which of the graphs B, C and D below shows the derivative 
of the function in graph A, and which shows its second derivative. 


I 
aD 
| 
N 
| 
N 
wo 
RY 


The derivative of a function is sometimes called its first derivative, to 
emphasise that it’s not one of the higher derivatives. This is the reason for 
the word ‘first’ in the name ‘first derivative test’, which you met on 

page 265. Later in this subsection, you’ll meet the second derivative test, 
which has the same purpose as the first derivative test — determining the 
nature of a stationary point — but involves the second derivative of a 
function. 


You know that the derivative of a function tells you the gradients of the 
graph of the function. So the second derivative of a function tells you the 
gradients of the graph of the first derivative. This means that the second 
derivative also gives you information about the shape of the graph of the 
original function. 


To see this, suppose, for example, that there is an interval on which the 
second derivative of a particular function is positive. By the 

increasing /decreasing criterion, this means that the first derivative is 
increasing on that interval. In other words, the gradient of the graph of the 
original function is increasing on that interval. This means that the shape 
of the graph of the original function on the interval must be something like 
one of the shapes shown in Figure 48. All of these shapes are sections of 


280 


5 Differentiating twice 


graphs with increasing gradient. For example, in the second shape the 
gradient increases from large negative values to small negative values. 


Figure 48 Sections of graphs with increasing gradient 


A section of a graph shaped like one of the diagrams in Figure 48 is said to 
be concave up. More precisely, a graph is concave up on an interval if the 
tangents to the graph on that interval lie below the graph, as illustrated in 
Figure 49. (Of course the tangents may cross the graph outside the 
interval.) Informally, a section of a graph is concave up if it looks like a 
cross-section of a bowl that’s the right way up, or part of such a 
cross-section. One way to remember the meaning of ‘concave up’ is to 


think of the rhyme ‘Concave up, like a cup’. . : 
Figure 49 A section of a 
So, in summary, if the second derivative of a function is positive on an graph is concave up when the 


interval, then the graph of the function is concave up on the interval. tangents lie below the graph 


For example, Figure 50 shows the graphs of the function f(x) = 27, its 
derivative f'(x) = 2x and its second derivative f” (x) = 2. The domain of 
each of these functions consists of all of the real numbers. The second 
derivative f” is positive on the whole domain, which tells you that the first 
derivative f’ is increasing on the whole domain, which in turn tells you 
that the graph of the function f is concave up on its whole domain. 


YA YA 
24 u 
1+ ake 
T T T T X X T T T T X 
-2-1,| 1 2 S22 
Z2 —2+ 


Figure 50 The graphs of (a) f(a) = x? (b) its derivative f'(x) = 2x 
(c) its second derivative f” (x) = 2 
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Figure 52 A section of a 
graph is concave down when 
the tangents lie above the 
graph 
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Now suppose that there’s an interval on which the second derivative of a 
particular function is negative. By the increasing/decreasing criterion, this 
means that the first derivative is decreasing on that interval. In other 
words, the gradient of the graph of the original function is decreasing on 
that interval. This means that the shape of the graph of the original 
function on the interval must be something like one of the shapes shown in 
Figure 51. 


Figure 51 Sections of graphs with decreasing gradient 


A section of a graph shaped like one of the diagrams in Figure 51 is said to 
be concave down. More precisely, a graph is concave down on an interval 
if the tangents to the graph on that interval lie above the graph, as 
illustrated in Figure 52. (Again the tangents may cross the graph outside 
the interval.) Informally, a section of a graph is concave down if it looks 
like a cross-section of a bowl that’s the wrong way up, or part of such a 
cross-section. One way to remember the meaning of ‘concave down’ is to 
think of the rhyme ‘Concave down, like a frown’. 


So, in summary, if the second derivative of a function is negative on an 
interval, then the graph of the function is concave down on the interval. 


For example, Figure 53 shows the graphs of the function f(x) = x, its 


derivative f'(x) = 3x? and its second derivative f”(x) = 6x. The domain 
of each of these functions consists of all of the real numbers. The second 
derivative f” is negative on the interval (—oo,0) and positive on the 
interval (0,00), which tells you that the first derivative f’ is decreasing on 
the interval (—oo, 0) and increasing on the interval (0,00), which in turn 
tells you that the function f is concave down on the interval (—oo, 0) and 
concave up on the interval (0, 00). 


(b) 


Figure 53 The graphs of (a) f(x) = x? (b) its derivative f'(x) = 3x? 
(c) its second derivative f” (x) = 6x 
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In some texts, the words convex and concave are used instead of concave 
up and concave down, respectively. 


A point where a graph changes from concave up to concave down or vice 
versa is called a point of inflection. For example, the graph of f(z) = x 
has a point of inflection at the origin. The tangents to a graph on one side 
of a point of inflection lie above the graph, and the tangents to the graph 
on the other side lie below the graph, as illustrated in Figure 54. The 
tangent at the point of inflection itself crosses the graph at that point. You 
can determine whether a point on the graph of a function is a point of 
inflection by checking whether its second derivative changes sign at that 
point. 


3 


If the gradient of a graph at a point of inflection is zero, then the point is a 
horizontal point of inflection — this is a type of stationary point that you . 
met earlier. Otherwise the point is a slant point of inflection. For Figure 54 ‘Tangents to a 
3 o graph at and near a point of 
example, the graph of y = x? and the graph in Figure 55(a) have flection 
horizontal points of inflection, and the graphs in Figures 55(b) and (c) 
have slant points of inflection. 


YA YA YA 
44 44 44 
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24 2 24 
14 14 ie 
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Figure 55 The graphs of (a) y = z? — 6x? + 12x — 6 
(b) y = x? — 6x? + 11x — 4 (c) y = x°? — 6x? + 13x — 8 


If the graph of a function is concave up or concave down on an interval, 
then we also say that the function itself is concave up or concave down on 
the interval, respectively. (In general, for simplicity we often say that a 
function has a certain property if its graph has that property, and vice 
versa.) So the relationship between the sign of the second derivative of a 
function and the shape of the graph of the function can be summarised as 
follows. 


Concave up/concave down criterion 
If f” (x) is positive for all x in an interval J, then f is concave up on T. 


If f(x) is negative for all x in an interval J, then f is concave down 
om J, 
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Figure 56 (a) A local 
maximum (b) a local 
minimum 
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Activity 26 Using the concave up/concave down criterion 


Consider the function f(x) = $x — 2x? + 11x? — 182. 
(a) Find the second derivative of this function. 


(b) Hence show that this function is concave up on the whole of its 
domain (all the real numbers). 


Hint for part (b): start by completing the square in your answer to 
part (a). 


Activity 27 Using the concave up/concave down criterion again 


Consider the function f(x) = a4 — 22”. 


(a) Find the second derivative of this function. 


(b) Hence show that this function is concave down on the interval (—2, 2). 


The connection between the sign of the second derivative of a function and 
the shape of the graph of the function gives a useful test, called the second 
derivative test, for determining the nature of some stationary points. This 
test can often, but not always, be used as an alternative to the first 
derivative test. It’s based on the fact that the graph of a function is 
concave down at and around a local maximum, and concave up at and 
around a local minimum, as illustrated in Figure 56. In other words, by 
the concave up/concave down criterion, the second derivative of a function 
is negative at and around a local maximum, and positive at and around a 
local minimum. 


Second derivative test (for determining the nature of a 
stationary point) 


If, at a stationary point of a function, the value of the second 
derivative of the function is 


e negative, then the stationary point is a local maximum 


e positive, then the stationary point is a local minimum. 


One way to remember the second derivative test is to keep in mind that a 
negative sign corresponds to a frown shape (a negative mood) and a 
positive sign corresponds to a smile shape (a positive mood). You might 


like to think of the emoticons G and Di 
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Alternatively, you might prefer simply to remember that a local maximum 
corresponds to decreasing gradient, and hence a negative second derivative, 
whereas a local minimum corresponds to increasing gradient, and hence a 
positive second derivative. This is illustrated in Figure 57. 


Figure 57 ‘The change in the gradient at and around a local maximum 
and a local minimum 


Example 12 Determining the nature of stationary points using the CS) 
second derivative test — 
Consider the function 


fa) =a? = 32? +1. 
(a) Find the stationary points of f. 


(b) Use the second derivative test to determine the nature of each 
stationary point of f. 


Solution 
(a) We have 
f'(z) = 32" — 6x 
= e 
Solving the equation f'(x) = 0 gives 
gaa A = 
that is, 


w= of reS 


Hence the stationary points of f are 0 and 2. 
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Figure 58 The graph of 
f(z) = z? — 327 +1 
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The graph of the function in Example 12 is shown in Figure 58. You can 
see that its stationary points are of the types determined in the example. 


Activity 28 Determining the nature of stationary points using the 
second derivative test 


Consider the function 
fas 2r — Šg? + 2r. 
(a) Find the stationary points of f, including the y-coordinates. 


(b) Use the second derivative test to determine the nature of each 
stationary point. 


(c) Using your answers to parts (a) and (b), sketch the graph of f. 


Unfortunately, if the value of the second derivative of a function at a 
stationary point is zero, rather than either negative or positive, then you 
can’t use the second derivative test to determine the nature of the 
stationary point. The stationary point might be a local maximum, a local 
minimum, a horizontal point of inflection, or none of these. 


For example, Figure 59 shows the graphs of the functions f(x) = z4, 


g(x) = —x*, h(x) = x? and k(x) = —x?. Each of these four functions has a 
stationary point at the origin, and at each of these stationary points the 
value of the second derivative is zero, as you might like to check. The 
stationary points on the four graphs are a local minimum, a local maximum 
and two horizontal points of inflection, respectively. 


YA YA YA 


Figure 59 The graphs of (a) f(x) = x4 (b) g(x) = —24 (c) h(x) = 2° 

(d) k(x) = =z’ 

So the second derivative test isn’t as widely applicable as the first 
derivative test. If you’re trying to use it to find the nature of a stationary 
point, and find that you can’t because the value of the second derivative at 
the stationary point is zero, then try the first derivative test instead. 


5.2 Rates of change of rates of change 


Earlier in the unit you saw that if two variables x and y are related, then 
the derivative dy/dz is the rate of change of y with respect to x. In the 
same way, the second derivative d?y/dx? is the rate of change of dy/dzx 
with respect to x. 


You saw that it’s particularly useful to think of derivatives as rates of 
change when you’re dealing with mathematical models of real-life 
situations. 


For example, consider the graph in Figure 60, which is a repeat of 
Figure 32(b) on page 251. It’s the displacement—time graph for a man 
walking along a straight path, and has equation 
_ 542 
s = 6t — gt ; 


where t is the time in hours that the man has been walking, and s is his 
displacement in kilometres from his starting point. 
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Figure 60 A displacement—time graph, with equation s = 6t — ie, for a 
man’s walk along a straight path 


As you know, the gradient of the displacement-time graph of an object is 
the object’s velocity. So, as you saw in Example 5 on page 253, the man’s 
velocity is given by the equation 

ds 

dt’ 


The graph of this equation is shown in Figure 61. A graph like this, which 
shows an object’s velocity plotted against time, is known as a 
velocity—time graph. 


au 
bee 
g 6 
x 54 
Ə 
Seo] 
E 34 
2 
14 


T a 
G57 h is 25 
time, t (hours) 


Figure 61 The velocity—time graph, with equation v = 6 — Št, 
corresponding to Figure 60 
The velocity—time graph in Figure 61 shows that the man’s velocity 


decreases as time goes on, as expected. For example, his velocity at the 
start of his walk is 6kmh7!, and his velocity one hour later is 4.75 km h™t. 


The gradient of the velocity—time graph is the rate of change of the man’s 
velocity with respect to time. The velocity—time graph is a straight line, so 


its gradient is the same at all points, and you can see from its equation, 
v=6- 5t, that the numerical value of the gradient is 4 = —1.25. 
Because the quantity on the horizontal axis (time) is measured in hours, 
and the quantity on the vertical axis (velocity) is measured in kilometres 
per hour, the gradient is measured in kilometres per hour per hour. So the 
gradient is —1.25 kilometres per hour per hour. This means that in each 


hour that passes, the man’s velocity decreases by 1.25 kilometres per hour. 


The units ‘kilometres per hour per hour’ are more usually called 
‘kilometres per hour squared’, and are usually abbreviated as km/h? or 
kmh~? (both read as ‘kilometres per hour squared’). Similarly, ‘metres per 
second per second’ are known as ‘metres per second squared’ and 
abbreviated as m/s? or ms~“, and so on. 


The rate of change of velocity with respect to time is called acceleration, 
and is usually denoted by the variable a. Like displacement and velocity, 
acceleration can be constant, or it can take different values at different 
times. For example, you’ve just seen that the man on his walk has a 
constant acceleration of —1.25kmh~?. Also like displacement and velocity, 
the values that acceleration takes can be positive, negative or zero. The 
fact that the man’s acceleration is negative at all times corresponds to the 
fact that his velocity is decreasing throughout his walk. Similarly, a 
positive acceleration would correspond to increasing velocity. 


Because the gradient of the velocity—time graph of an object is the 
acceleration of the object, the important facts in the box on page 252 can 
be extended as follows. 


Relationships between displacement, velocity and 
acceleration 


Suppose that an object is moving in a straight line. If t is the time 
that has elapsed since some chosen point in time, and s, v and a are 
the displacement, velocity and acceleration of the object, respectively, 
then 


—, — — = —. 
dt’ dt dt? 

(Time, displacement, velocity and acceleration can be measured in 
any suitable units, as long as they are consistent.) 


An example of a set of consistent units is seconds for time, metres for 
displacement, metres per second for velocity, and metres per second 
squared for acceleration. 


In the discussion above, the value of the man’s acceleration was found by 
using the fact that the equation relating his velocity to time is the 
equation of a straight line. An alternative way to find his acceleration is to 
use differentiation, as illustrated in the example below. The advantage of 
this method is, of course, that it can be used even when the acceleration 
isn’t constant. 
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Figure 62 The 
displacement-time graph for 
an object falling from rest 
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Example 13 Using differentiation to find an acceleration 


Suppose that a man walks along a straight path, and his displacement 
s (in kilometres) from his starting point at time t (in hours) after he 
began his walk is given by the equation s = 6t — 20, Use 
differentiation to show that the man’s acceleration is constant, and to 
find its value. 


Solution 


The man’s displacement s (in km) at time ¢ (in hours) is given by 


s = 6t— 3. 
Hence his velocity v (in kmh~?) at time t (in hours) is given by 
ds 
oat 
=6- - x 2t 
=6-— Št. 


It follows that his acceleration a (in km h7?) at time ¢ (in hours) is 
given by 


dv 5 
a 
er ae 


That is, the man’s acceleration is constant, with value —1.25 kmh7?. 


When you’re working with displacement, velocity and acceleration along a 
straight line, it’s important to keep in mind that displacement is different 
from distance, and velocity is different from speed. As you know, distance 
is the magnitude of displacement, and speed is the magnitude of velocity. 
For example, an object that has a displacement of —10 metres from some 
reference point has a distance of 10 metres from that point, and, similarly, 
an object that has a velocity of -2ms~! has a speed of 2ms7!. 
Unfortunately, the magnitude of acceleration is also known as acceleration, 
but this doesn’t usually cause any confusion in practice. 


Consider the graph in Figure 62, which is the displacement—time graph for 
an object falling from rest, where the positive direction along the line of 
motion has been taken to be upwards and the reference point has been 
taken to be the point from which the object falls. Notice that the object’s 
displacement from its starting point is decreasing, but its distance from its 
starting point is increasing. In general, if an object’s displacement from a 
chosen reference point is always negative over some time interval, then 
saying that its displacement is decreasing over the time interval is the same 
as saying that its distance from the reference point is increasing over the 
time interval, and vice versa. 


Similarly, if an object’s velocity is always negative over some time interval, 
then saying that its velocity is decreasing over the time interval is the same 
as saying that its speed is increasing, and vice versa. For example, in 
Figure 62 the velocity (the gradient of the graph) is decreasing, but the 
speed of the object is increasing. 


Activity 29 Using differentiation to find an acceleration 
Suppose that the displacement of an object along a straight line is given by 
the equation 
s = 4t? — 5t? + 16t + 8, 
where t is the time in seconds and s is the displacement in metres. 


(a) Find an equation giving the object’s velocity in terms of time, and an 
equation giving the object’s acceleration in terms of time. 


(b) Is the object’s acceleration constant? Explain how you know. 
(c) Find the object’s velocity and acceleration at time 6 seconds. 


(d) This part of the question is about the time interval from 5 seconds to 8 
seconds (not including the endpoints of this time interval). 


(i) Over this time interval, is the object’s velocity increasing, 
decreasing or neither? Explain how you know. 


(ii) Over the time interval, is the object’s velocity always positive, 
always negative or neither? 


(iii) Over the time interval, is the object’s speed increasing, decreasing 
or neither? Explain how you know. 


Activity 30 Using differentiation to find another acceleration 


In Subsection 3.1 you saw that the displacement of an object falling from 
rest is given by the equation 


s = —4.9t?, 


where ¢ is the time in seconds that the object has been falling, and s is its 
displacement in metres from its starting point (where the positive direction 
is upwards). This is assuming that the effect of air resistance is negligible. 


Show that the acceleration of the object is constant, and find its value. 


Activity 30 shows that the model for the motion of a falling object given in 
Subsection 3.1 is based on constant acceleration. It’s a good model for the 
motion of any object falling from rest, provided that the effect of air 
resistance is negligible (which it is if the object is fairly compact and the 
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fall is fairly short). The constant acceleration of a falling object in the 
absence of air resistance is known as the acceleration due to gravity. 


If the falling object is near the surface of the Earth, then its acceleration 
due to gravity is approximately the value found in Activity 30. However, 
the value of the acceleration due to gravity varies slightly depending on the 
location of the object on the surface of the Earth. This is because of 
variations in the shape and density of the Earth. For example, the 


magnitude of the acceleration due to gravity in London is 9.812 m s7 
three decimal places, whereas in Sydney it is 9.797ms °. 


2 
, to 


2 


In the fall of 1972 President Nixon announced that the rate of 
increase of inflation was decreasing. This was the first time a sitting 
president used the third derivative to advance his case for reelection. 


(Hugo Rossi (1996) ‘Mathematics is an edifice, not a toolbox’, Notices 
of the American Mathematical Society, vol. 43, no. 10.) 


Learning outcomes 


After studying this unit, you should be able to: 


understand the meaning of differentiation 

understand the idea of differentiation from first principles 

use both Lagrange and Leibniz notation for derivatives 
differentiate power functions 

use the constant multiple rule and the sum rule for derivatives 


use differentiation to determine whether a function is increasing, or 
decreasing, on an interval 


find the stationary points of some functions 


determine the nature of some stationary points using either the first or 
second derivative test 


use differentiation to determine whether a function is concave up, or 
concave down, on an interval 


understand and work with derivatives as rates of change in 
mathematical models 


understand and work with the relationships between displacement, 
velocity and acceleration. 


Solutions to activities 


Solution to Activity 1 

In the first half-hour the man walks about three 
kilometres, and in the final half-hour he walks about 
one and a half kilometres. 


Solution to Activity 2 
a) Large negative gradient: graph E. 
b) Small negative gradient: graph C. 


( 

( 

(c) Zero gradient: graph A. 

(d) Small positive gradient: graph B. 
( 


e) Large positive gradient: graph D. 


Solution to Activity 3 

The point on the graph with x2-coordinate 0.9 has 

y-coordinate 0.9? = 0.81. The gradient of the line 

through (1,1) and (0.9,0.81) is 

081-1 —0.19 
run 09-1 —-0.1 

So another approximate value for the gradient of the 

graph at (1,1) is 1.9. 


rise 


= 1.9. 


Solution to Activity 4 


(What you should have found in this activity is 
discussed in the text after the activity.) 


Solution to Activity 5 


By formula (1), the gradient of the graph of y = z? 
at the point with x-coordinate 3 is 


2% 3=6, 


Similarly, the gradient at the point with 
x-coordinate —1.5 is 


2x (—1.5) = —3. 


Solution to Activity 6 
(a) Multiplying out (x + h)* gives 
(z +h)" 
= (e+ hye +h)? 
= (x +h)(2° + 327h + 32h? + h?) 
= g(x? + 327h + Sah? + h?) 
+ h(x? + 327h + 3xh? + h?) 
= qf + 32°h4+3a7h? + xh? 
+ 23h 327h? + 32h? + ht 
= gt + 4o°h + 6a7h? + 4h? + hå. 


Solutions to activities 


(b) The difference quotient for the function 
f(x) = 2+ at z is 


f(x ny ie z) _ (@ +h) -1 

SSS SS 
By part (a), 

daia f(x) 


B xi + n + 6x?h? + deh? + ht — xt 

7 h 

E 4z3h + 627h? + 4rh? + ht 

7 h 

= 4r? + 627h + Ach? + Rè. 
Each of the terms in the final expression above, 
except the first term, contains the factor h. 
Hence, as h gets closer and closer to zero, each 
of the terms except the first term gets closer 
and closer to zero. So the value of the whole 
expression gets closer and closer to the value of 
the first term, 47°. That is, the formula for the 
derivative is 

f'(x) = 4zè. 
By the formula found in part (b), the gradient 
of the graph of the function f(x) = x* at the 
point with x-coordinate i is 

133 bo i 

(The graph of f(x) = x4 is shown below.) 


SY 


Solution to Activity 7 
(a) f(x) = xë, so 


f'(x) = 82". 

(b) f(z) = 2°, so 
f'(x) = 524 

() fe) = 4 = 279, 90 
f(z) 35 = -5 
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(A) f(z) =2/, s0 
f'(a) = $a? = 8a, 


(£) f(a) = 2°/?, so 
fia) = 50°”. 
(This derivative can also be written as 


f' (x) = 3V2°.) 


his form of the derivative is preferable to 


= 4/2, which could be misread as 


f(a) 
f(a) = Bvz) 
f 


1 -8 
(h) fla) = =, so 
_ 8 
f'(z) 82° = -z5 
(i) y= o gp 
z1/4 
C lisa 1, 1 1 
dx 4 o/4 45/4 
(j) y=2/3, so 
dy = Le = 1 
dz 3 3x2/3 
1 Z 
(k) y Va gif? x ¥/?, so 
dy 1 ap 1 1 
dz z 2 ü x3/2 2g3/2 
(1) y= Wa = 29/8, so 
dy 5.2/3 
(m) y = x?/7, so 
dy 2 -s7 2, 1 _ 2 
dr 7 7 æ" T5 
(n) f(z) =27?, so 
2 
fms = -z5 
_ 1 _ 1⁄3 
(0) f(z) = ae so 
1 1 1 1 
! —4/3 
f (x)= -73 ss 74/3 374/38 
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Solution to Activity 8 
By the solution to Example 2(c), the derivative of 
y = yz is 
dy 1 
de 2yr 
So the gradient of the graph at the point with 


x-coordinate 4 is 
1 1 1 


wa 2x2 4 
(Check: the gradient of the line in Figure 22, in the 
margin beside this activity, appears to be about +.) 
Solution to Activity 9 
(a) f(x) = 523, so 
f'(x) = 5 x 3x? = 157°. 


(c) f(x) = 2/7 = 27", so 
fi(e) =2x 5a? 


1 
~ gil? 
oe 
F 
(d) f(x): = 62,86 
f'(x)=6x1= 
(e) f@) =F Jm 80 
EE 
(f) FO) = = 2, s0 
FE = 2 x (1)? 
O 2 
=-5 


da _ 19x lp-2/3 
dr 3 
1 
=X -z3 
_ 4 
= R 


Solutions to activities 


Solution to Activity 10 
The derivative of the function f(x) = 32? is 
f'(x) = 3 x We = 6z. 
So the gradient of this function f at the point with 
x-coordinate 2 is 


6 x 2 = 12. 


Solution to Activity 11 

(a) f(x) = 6x? — 2x + 1, so 
f'(x) = 12g — 2. 

(b) f(x) = 323 + 22? +x -— 4, so 
f'(x) = Ẹ x 3z? +2 x 2z +1 


= 2x? + 4r +1. 
(c) f(z) = 5x +1, so 
f(g) = 5; 


(Notice that in general the derivative of 
f(x) = mae + cis f'(x) =m, as you’d expect.) 


(d) g(t) = 4t + vt = 4t + t!/?, so 


NIe NlRENMIlR ble 


= 


+ x?)(1 +32) 
4+ 3247 432%, 


(e)  f(#) = 


H an 


so 
f'(z) =3 + 2a + 92°. 
(f) k(u) = (w+3)? 
= u? + 6u +9, 
so 
k' (u) = 2u + 6. 
(g) f(x) = 30(2/? — 2), so 
7 (x) =30 ($21? — 1) 
= 30 (4vE- 1) 
= 45,/x — 30 
= 15(3Vz — 2). 
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(h) f(x) = a(a3/2 — r) = 25/2 — 2?, so 
a= 53/2 — 2x. 
(Note that this function can’t be differentiated 
in a similar way to the function in part (g), 
because the multiplier of the brackets, x, isn’t a 
constant, and hence the constant multiple rule 
doesn’t apply.) 


iy oe (t — 2)(t +5) 
t 
_ #+3¢-10 
E t 
=t+3-10t7', 
so 
d 
a =j= 0x 
=1+ 100? 
10 
(The final answer can also be expressed as 
t? + 10 
i? ) 
; ct f/x 
i == 
x +r? 
= x7! 4 g73/2 
so 
dy -2 —5/2 
a 
=. 3 
2 25/2" 


(The final answer can be expressed in various 
ways. For example, you could combine the 
fractions and simplify the result like this: 


1 3 O 22 3a1/2 

£2?  2g5/2 0 Ox 2x3 
2a 4+3/a 
a ae 


(k) y= (27? +1)@/* + 52) 
= 77/3 4594/3 4 91/3 4 5a 


So 
dy = 2 —1/3 20 1/3 1 —2/3 
o 2, 20e* an 
Sanet g ae 
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Solution to Activity 12 
(a) The displacement s (in metres) of the object at 
time t (in seconds) is 
s = —4.9¢?. 
Hence the velocity v (in ms~+) of the object at 
time t (in seconds) is 


v- ds 
dt 
= —9.8t. 
(b) When t = 3, 


v = —9.8 x 3 = —29.4. 
So the velocity of the object 3 seconds into its 
fall is —29.4m s7}. 

(c) When the object has a speed of 15ms~, it has 
a velocity of —15ms~!. So the time t at which 
the object is moving at this speed is given by 

—15 = —9.8t. 
This equation has solution 

is 

9.8 

So the object takes about 1.5 seconds to reach a 

speed of 15ms~!. 


t = 1.5 (to 1 d.p.). 


(As a check on parts (b) and (c), you can see that 
the gradients of the graph of the equation 

s = —4.9t?, which is given in Figure 34 and repeated 
below, at the points with time-coordinates 1.5 and 3 
seconds are about —15ms~! and about —30 m s7}, 
respectively. The tangents at these points are shown 


below.) 


time, t (seconds) 
1 > 


4 


3 


E 

N 

E —20-4 

© —304 

5 —40Ṣ 

8 —504 

A —604 

3 —70+ 
—80- 
=o 


Solution to Activity 13 


(a) 


The equation for c in terms of q is 


c = 3000 + 4q + TI 


Hence 
dc 


m e 


dq 


That is, an equation for m in terms of q is 


=4+ z4 


m=4+ 4: 
Substituting g = 300 into the equation found in 
part (a) gives 

m = 4+ ie x 300 = 8.8. 
So the marginal cost per kilogram of making 
extra chocolate when the amount of chocolate 
already being made is 300 kilograms is £8.80. 


The marginal cost is equal to the selling price 
when 

m = 16. 
By part (a), the value of q for which this 
happens is given by 


4+ i554 = 16. 
Solving this equation gives 
2 = x 
T1254 = 12; 
that is, 
q = 750. 


So the company should make 750 kg of 
chocolate per week. 


(As a check on parts (b) and (c), you can see 
that the gradients of the graph in Figure 36 at 
the points with g-coordinates 300 and 750 
kilograms are about £9 per kilogram and about 
£16 per kilogram, respectively. The tangents at 
these points are shown below.) 


total cost, c (£) 


Solutions to activities 


c = 3000 + 4q + 559° 


> 


T T 
600 800 
quantity, q (kg) 


200 400 


Solution to Activity 14 
(a) f(x) = 2a° — 8x? + 30x — 36, so 


3 

f'(x) = 2x? — 16x + 30 
= 2(a? — 8x + 15) 
= 2(x — 3)(a — 5). 


(b) When z is less than 3, the values of x — 3 and 


(The graph of f(x) = 


x — 5 are both negative, and hence the value of 
f'(x) = 2(a — 3)(x — 5) is positive. 


Similarly, when x is greater than 5, the values of 
x — 3 and x — 5 are both positive, and hence the 
value of f'(x) = 2(x — 3)(x — 5) is also positive. 


Therefore, by the increasing /decreasing 
criterion, the function f is increasing on each of 
the intervals (—oo, 3) and (5, 00). 

When z is in the interval (3,5), the value of 
x — 3 is positive and the value of x — 5 is 
negative, and hence the value of 

f'(x) = 2(a — 3)(x — 5) is negative. 
Therefore, by the increasing/decreasing 
criterion, the function f is decreasing on the 
interval (3,5). 

22° — 82? + 30a — 36 is as 


follows.) 
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quadratic formula. This gives 


_ -(-2) + J/(2)? = 4x3 x (-2) 
7 2x3 

_ 244/28 

6 

242/7 

© 6 

o l1+v7 

~ 8 


= 1.22 or —0.55 (to 2 d.p.). 


. sae That is, the stationary points are 1.22 and —0.55, to 
Solution to Activity 15 two decimal places. 
(a) f(x) = 23 — 3x? + 4x + 3, so 


fic) =3e? TE Solution to Activity 17 


= 3(x* — 2x) +4 (a) Here f(x) = 3x4 — 22? — 9x? + 7, so 
= 3((@-1)?-1)+4 jf @) = 12x? — 6x? — 18x 
=g- +1. = 6x(2x” — x — 3) 


= 6z(2x — 3)(x + 1). 


(b) For every value of x, the expression (x — 1)? is i , ; 
Solving the equation f'(x) = 0 gives 


non-negative, and hence the expression 


3(x — 1)? +1 is positive. 6x(2x — 3)(x + 1) = 0; 
That is, the derivative f'(x) is positive on the that is, 
interval (—oo,0o). Therefore, by the zr=0 or r= - or z= -—l. 
increasing/ decreasing criterion, the function f is Hence the stationary points of f are 0, j 
increasing on the interval (—oo, 00). snd. 
22 ye ae : 
(The = of f(z) = x" — 32° + 4x + 3 is shown (b) A table of signs for f'(x) = 6x(2x — 3)(x + 1) is 
S given below. (The table is split to make it fit in 
a narrow text column.) 
z (—oo, —1) | —1 | (—1,0) 
2 o| + 
=e = gl + 
x — ra 
o | (0,5) | 5 | Gov) 
o; pf + 
= x [E $ 
Solution to Activity 16 ae ez 
Here f(z) = x? — x? — 22, so 0; — | 0 + 
i 
f'(x) = 3x4 — 2x — 2. a he =| > 


The expression for f'(x) cannot be factorised easily, 
so we solve the equation f’(x) = 0 using the 
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The table shows that —1 is a local minimum, 0 
is a local maximum, and 3 is a local minimum. 


(You can see these stationary points on the graph 
below.) 


Y= sr =o 9r T 


Solution to Activity 18 


(a) 


f(x) = xf — 223, so 
f(x) = 4r’ — 62? 
= 227 (2x — 3). 
Solving the equation f'(x) = 0 gives 
2x7 (2x — 3) = 0; 
that is, 


x=0 or 3 


Hence the stationary points of f are 0 and 3. 


To determine the nature of the stationary 
points, apply the method described on page 268. 


Consider the values —1, 1 and 2. The values —1 
and 1 lie on each side of the stationary point 0, 
and the values 1 and 2 lie on each side of the 
stationary point 3. 


The function f is differentiable at all values of x 
(as is every polynomial function). 


Also, there are no stationary points between —1 
and 0 or between 0 and 1. Similarly, there are 
no stationary points between 1 and 3 or 
between 3 and 2. 


Since f'(x) = 2x7(2a — 3), we have 
f’(-1) = 2(-1)?(2(-1) - 3) = 2(-2 - 
f'() =2x 1°(2 x 1 — 3) = 2(2 — 3) = —2 
f'(2)=2 x 2? (2 x 2 — 3) = 8(4 — 3) = 8. 

Since the derivative is negative at both —1 


and 1, the stationary point 0 is a horizontal 
point of inflection. 


3) = —10 


Solutions to activities 


Since the derivative is negative at 1 and positive 
at 2, the stationary point 3 is a local minimum. 


(You can see these stationary points on the graph 
below.) 


y= 2r 


Solution to Activity 19 


(a) 


Here 
p = —0.006q? + 3.3q, 
so 
OF —0.012q + 3.3. 
dq 
The stationary point q is given by 
dp 
dg = U; 
that is, 
—0.012q + 3.3 = 0, 
so i 
q= 0012 = 275. 


So if the bakery wants to earn the maximum 

profit, then it should make and sell 275 cakes 

per day. 

Substituting q = 275 into the formula for p gives 
p = —0.006 x 275? + 3.3 x 275 = 453.75, 

so the daily profit is £453.75. 

(The graph of the equation p = —0.006q? + 3.3q 

is shown below. You can see that the stationary 


point appears to be roughly (275, 450), which 
accords with the results found above.) 
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When x = —6, 
Uh 
y = f(—6) 
5004 p = —0.0069? + 313g 
4 = —;(—6)° — 4(—6)? — 12(-6) — 4 
ll =72— 144+72—4 
3004 ot 
seal When x = —2, 
al y = f(—2) 
T = T > = 1 9 3 4 9 2 12 2 4 
100 200 300 400 500 = - 3(—2)° — 4(—2) (—2) 
= §-164+24-4 


20 
a 


Solution to Activity 20 So the stationary points are (—6,—4) and 
(a) If the coefficient of x? is positive, then the (—2, 20). 


mo sa 7 a for ya aes values Atben for FG) =k OG 4D) i 
ice acres S A cae given below. (The table is split to make it fit in 


negative values of x. 
a narrow text column.) 
If the coefficient of x? is negative, then the 


; sy —90,—6) | —6 
graph tends to minus infinity for large positive x = ) 
values of x, and tends to infinity for large —1 — — 
negative values of x. xr+6 = 0 
(This is because for large positive and large z+2 = _ 
negative values of x, the value of the term in z’, fla) > 0 
the dominant term, overwhelms the values of 
the other terms.) slope of f = m 
(b) A cubic function can have one, two or three (—6, —2) | —2 | (—2,c0) 
x-intercepts, as illustrated below. 
+ + + 
y y y J 0 M 
if if if t cn ae 
P — N 


By the first derivative test, (—6, —4) is a local 
minimum, and (—2, 22) is a local maximum. 


Solution to Activity 21 3 
When g = 0, 


(a) The function is 
fa) = ia? Ag? — 12x — 4. y = f(0) = —4. 
So the derivative is 
f (2) Soe" = 8x — 12 
= — (x° + 8x + 12) 
= —(x + 6)(x + 2). 
Solving the equation f'(x) = 0 gives 
—(x + 6)(a + 2) = 0; 
that is, 


x=-6 or x=-2. 


300 


Solutions to activities 


A sketch of the graph is shown below. A sketch of the graph is shown below. 


Solution to Activity 22 
(a) Here 
f(z) = įr? — 2? — 8z + 1. 


(b) The function is 


Ja= ir? — z? + 3r. 
So the derivative is 
fæ) = 4a? -2+3 
= 4 (x° — 6x + 9) 
= +(x — 3). 
Solving the equation f'(x) = 0 gives 
g(t — 3)" =0; 
that is, 
xr=3. 
When x = 3, 
y = f(8) = § x 3? - 3? 4+3x3=3. 
So there is just one stationary point, (3,3). 


Since f'(x) is positive everywhere except at 

x = 3, it follows, by the first derivative test, 
that (3,3) is a horizontal point of inflection and 
also, by the increasing /decreasing criterion, that 
f is increasing to the left and the right of the 
stationary point. 


When x = 0, 
y= f(0)= 0; 


This gives 
f'(£) =a? — 2g — 8 = (x — 4)(£ + 2), 
so the stationary points are —2 and 4. 


Only the stationary point —2 is inside the 
interval, and the value of f at this point is 


f-2)= 302 = (2 -8(-2) +1 
=-§-44+16+1 
9... qq 
= 3 = 104, 
The values of f at the endpoints of the interval 
are 


f(—3) = $(-3)° — (-3)? — 8(-3) +1 


=-%_94244+1 
=F 
and 
f(2)=4%x2°-2?-8x241 
= 8-4-1641 
= —2 = - 163. 


The greatest value of f on the interval [—3, 2] 
is 105, and the least value is —16%. 


(The graph of f is below.) 
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(b) 


302 


Here 
f(x) = 3a? — 2a + 5. 
This gives 
f'(x) = 6x — 2 = 2(32 — 1), 
so the only stationary point is 3. 


This stationary point is inside the interval, and 
the value of f at this point is 


2 
f(g) =3(3) —2(3) +5 
a1 4 2 
=3-3t5 
_ 42 
= 45. 
The values of f at the endpoints of the interval 
are 
f(-1) = 3(-1)? — 2(-1) +5 


=3+2+5 
=10 
and 
f(4)=38x4-2x44+5 
=48-8+5 
= 45. 


The greatest value of f on the interval [—1, 4] 
is 45, and the least value is 42. 


(The graph of f is below.) 


Solution to Activity 23 


An example is the function f(x) = |x| and the 
interval [—1, 1]. This function is continuous on this 
interval. Its least value on the interval is 0, which 
occurs when x = 0, but 0 is neither a stationary 
point of the function nor an endpoint of the interval. 
The function is not differentiable when x = 0. 


Solution to Activity 24 


Here 

f(z) = 224 + 327 + z, 
so 

f'(x) = 8a? + 6x +1, 
and 


f'(x) = 24x? + 6. 


Solution to Activity 25 

The derivative of the function in graph A is shown 
in graph C. 

The second derivative of the function in graph A is 
shown in graph D. 


(There are various ways to work this out. For 
example, notice that the function in graph A has a 
stationary point when x is approximately 0. So the 
graph of its derivative must cross or touch the 
z-axis when x is approximately 0, which tells you 
that the derivative must be the function in graph C. 
Alternatively, notice that the function in graph A is 


decreasing for values of x just less than 0. So its 
derivative must be negative for these values of x, 
which again tells you that the derivative must be 
the function in graph C. You can use similar 
arguments to work out that the derivative of the 
function in graph C is the function in graph D.) 


(Graph B shows the third derivative of the function 
in graph A.) 


Solution to Activity 26 


(a) We have 
f(a) = $x* — 22° + 11a? — 182, 
so 
ACs = 2r’ — 6x? + 227 — 18, 
and 


f" (£) = 2a? — 12g + 22. 
To show that f is concave up on all of its 
domain, which is all the real numbers, we show 
that f” is positive on the interval (—oo, o0). 
One way to do that is by completing the square, 
as follows. 
f'(x) = 2a? — 12x + 22 
= 2(x? — 6x) + 22 
= 2 ((x — 3)? — 9) + 22 
= 2(x — 3)? — 18 + 22 
= 2(x — 3) + 4. 
The first term in this expression for f” (x) is 
greater than or equal to zero for every value 


of z, and the second term is positive, so f” (x) is 
positive for every value of x. 


It follows, by the concave up/concave down 
criterion, that f is concave up on (—o0, oo). 
(The graph of f is shown below.) 


y= r’ — 2x3 
+ Tia? — 182 


Solutions to activities 


Solution to Activity 27 


(a) We have 
fa)= gu = 2x?, 
so 
f'(x) = ir? a Ax, 
and 
f" (2) =a? —4, 


Factorising the expression for f”(x) found in 
part (a) gives 

f” (x) = (x + 2)(x — 2). 
When z is in the interval (—2, 2), the value of 
x + 2 is positive and the value of x — 2 is 
negative, so the value of f’’(a) is negative. 
It follows, by the concave up/concave down 
criterion, that f is concave down on (—2, 2). 
(The graph of f is shown below.) 


Solution to Activity 28 
(a) We have 
{z= 2r’ — 327 + 22, 
so 
f'(x) = 2z? — 5z +2 
= (2x — 1)(a — 2). 
Hence the stationary points of f are i and 2. 


The y-coordinate of the stationary point = is 
given by 
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The y-coordinate of the stationary point 2 is 
given by 


y=%x 2-3 x2?4+2x2 
= 48 19-44 
4,9 
=—2. 
Hence the stationary points are (5, st) and 
(2,5): 
We have 
f"(x) = 4x —5, 
so 
f"(§) =4x 5-5 =2-5=-3 
and 


f/@)S4%9—5 8-5 = 3 
11) isa 


Hence, by the second derivative test, (4, 4 
local maximum and (2, —2) is a local minimum. 


The y-intercept of f is 0. 
A sketch of the graph of f is shown below. 


y = 203 — 32? + 2x 


Xy 


Solution to Activity 29 


(a) 
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The displacement s metres of the object at time 
t seconds is given by 
s = 4t? — 5t? + 16t + 8. 

Hence the velocity vms~! of the object at time 
t seconds is given by 

_ ds 

~ dt 
and the acceleration ams~? of the object at 


time t seconds is given by 


du 
ey 
= H 


t? — 108+ 16, 


(b) 


The object’s acceleration is not constant, 
because the expression that gives the object’s 
acceleration in terms of time is 2t — 10, which is 
not a constant (it contains the variable t). 


When t = 6, 

v = 6? — 10 x 6 + 16 = —8, 
and 

a=2x6-—10=2. 


So at time 6 seconds, the velocity of the object 
is —8ms™t and its acceleration is 2ms~?. 

(i) When t is between 5 and 8 (exclusive), the 
value of a = 2t — 10 is always positive. 
Hence the acceleration of the object is 
always positive in the given time interval, 
so its velocity is increasing over this time 
interval. 


We have v = t? — 10t + 16 = (t — 2)(¢ — 8). 
When t is between 5 and 8 (exclusive), the 
value of t — 2 is always positive and the 
value of t — 8 is always negative, so the 
value of v = (t — 2)(t — 8) is always 
negative. That is, the velocity of the object 
is always negative over the given time 
interval. 


By parts (b)(i) and (b)(ii), over the given 
time interval the object’s velocity is always 
negative, and always increasing. So its 
speed is decreasing over this time interval. 


(ii) 


(iii) 


(The object’s displacement, velocity and 
acceleration are shown in the graphs below. 
Notice in particular that over the time 
interval from 5 seconds to 8 seconds 
(excluding the endpoints), the object’s 
velocity is negative and increasing. So the 
magnitude of its velocity is decreasing; 
that is, its speed is decreasing.) 


Solutions to activities 


(The object’s displacement, velocity and 
acceleration are shown in the graphs below. Notice 
in particular that although the velocity of the object 
is decreasing, its speed is increasing, because its 
velocity is negative.) 
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Solution to Activity 30 


The displacement s metres of the falling object at 
time t seconds is given by 
s = —4.9t?. 
Hence the velocity vms— 
seconds is given by 
ds 
= — = —9.8t, 
oat 
and the acceleration ams~? of the object at time t 
seconds is given by 


acceleration, a (m s7?) 


l of the object at time t 


So the acceleration of the object is constant, with 
value —9.8ms~?. 
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